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EXPRESSION VECTORS FOR STIMULATING AN IMMUNE 
RESPONSE AND METHODS OF USING THE SAME 

CROSS-REFERENCES TO RELATED APPLICATIONS 
5 This application claims the benefit of 09/078,904, filed May 13, 1998, and 

60/085,751, filed May 15, 1998, both herein incorporated by reference in their entirety. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 

10 This invention was made with government support imder NIK Grant No. 

AI-42699-01, NIH Grant No. AI38584-03, and NIH Contract No. NOl-AI-45241. The 

Government has certain rights in this invention. 

FIELD OF THE INVENTION 
15 The present invention relates to nucleic acid vaccines encoding multiple 

CTL and HTL epitopes and MHC targeting sequences. 

BACKGROUND OF THE INVENTION 
Vaccines are of fundamental importance in modem medicine and have 
20 been highly effective in combating certain human diseases. However, despite the 

successful implementation of vaccination programs that have greatly Umited or virtually 
eliminated several debiUtating human diseases, there are a number of diseases that affect 
miUions worldwide for which effective vaccines have not been developed. 

Major advances in the field of immunology have led to a greater 
25 understanding of the mechanisms involved in the immune response and have provided 
insights into developing new vaccine strategies (Kuby, Immunology, 443-457 (3rd ed., 
1997), which is incorporated herein by reference). These new vaccine strategies have 
taken advantage of knowledge gained regarding the mechanisms by which foreign 
material, termed antigen, is recognized by the immune system and eliminated fi-om the 
30 organism. An effective vaccine is one that elicits an immime response to an antigen of 
interest. 

Speciahzed cells of the immune system are responsible for the protective 
activity required to combat diseases. An immune response involves two major groups of 
cells, lymphocytes, or white blood cells, and antigen-presenting cells. The purpose of 
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these immune response cells is to recognize foreign material, such as an infectious 
organism or a cancer cell, and remove that foreign material from the organism. 

Two major types of lymphocj^es mediate different aspects of the immune 
response. B cells display on their cell surface specialized proteins, called antibodies, that 
5 bind specifically to foreign material, called antigens. Effector B cells produce soluble 
forms of the antibody,' which circulate throughout the body and function to eliminate 
antigen from the organism. This branch of the inunune system is known as the humoral 
branch, Memory B cells function to recognize the antigen in future encounters by 
continuing to express the membrane-bound form of the antibody. 

10 A second major type of lymphocyte is the T cell. T cells also have on their 

cell surface specialized proteins that recognize antigen but, in contrast to B cells, require 
that the antigen be bound to a specialized membrane protein complex, the major 
histocompatibility complex (MHC), on the surface of an antigen-presenting cell. Two 
major classes of T cells, termed helper T lymphocytes ("HTL") and cytotoxic T 

15 lymphocytes ("CTL"), are often distinguished based on the presence of either CD4 or 
CDS protein, respectively, on the cell surface. This branch of the immune system is 
known as the cell -mediated branch. 

The second major class of immune response cells are cells that function in 
antigen presentation by processing antigen for binding to MHC molecules expressed in 

20 the antigen presenting cells. The processed antigen bound to MHC molecules is 

transferred to the surface of the cell, where the antigen-MHC complex is available to bind 
to T cells. 

MHC molecules can be divided into MHC class I and class II molecules 
and are recognized by the two classes of T cells. Nearly all cells express MHC class I 

25 molecules, which function to present antigen to cytotoxic T lymphocytes. Cytotoxic T 
lymphocytes typically recognize antigen bound to MHC class I. A subset of cells called 
antigen-presenting cells express MHC class II molecules. Helper T lymphocytes 
typically recognize antigen bound to MHC class II molecules. Antigen-presenting cells 
include dendritic cells, macrophages, B cells, fibroblasts, glial cells, pancreatic beta cells, 

30 thymic epithelial cells, thyroid epithelial cells and vascular endothelial cells. These 

antigen-presenting cells generally express both MHC class I and class II molecules. Also, 
B cells function as both antibody-producing and antigen-presenting cells. 

Once a helper T lymphocyte recognizes an antigen-MHC class 11 complex 
on the surfaceof an antigen-presenting cell, the helper T lymphocyte becomes activated 
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and produces growth factors that activate a variety of cells involved in the immune 
response, including B cells and cytotoxic T lymphocytes. For example, under the 
influence of growth factors expressed by activated helper T lymphocytes, a cytotoxic T 
lymphocyte that recognizes an antigen-MHC class I complex becomes activated. CTLs 
5 monitor and eliminate cells that display antigen specifically recognized by the CTL, such 
as infected cells or tumor cells. Thus, activation of helper T lymphocytes stimulates the 
activation of both the humoral and cell-mediated branches of the immune system. 

An important aspect of the immune response, in particular as it relates to 
vaccine efficacy, is the manner in which antigen is processed so that it can be recognized 

10 by the specialized cells of the immune system. Distinct antigen processing and 

presentation pathways are utilized. The one is a cytosolic pathway, which results in the 
antigen being bound to MHC class I molecules. An alternative pathway is an 
endoplasmic reticulum pahtway, which bypasses the cytosol. Another is an endocytic 
pathway, which results in the antigen being bound to MHC class II molecules. Thus, the 

15 cell surface presentation of a particular antigen by a MHC class II or class I molecule to a 
helper T lymphocyte or a cytotoxic T lymphocyte, respectively, is dependent on the 
processing pathway for that antigen. 

The cytosolic pathway processes endogenous antigens that are expressed 
inside the cell. The antigen is degraded by a specialized protease complex in the cytosol 

20 of the cell, and the resulting antigen peptides are transported into the endoplasmic 
reticulum, an organelle that processes cell surface molecules. In the endoplasmic 
reticulum, the antigen peptides bind to MHC class I molecules, which are then 
transported to the cell surface for presentation to cytotoxic T lymphocytes of the immune 
system. 

25 Antigens that exist outside the cell are processed by the endocytic 

pathway. Such antigens are taken into the cell by endocytosis, which brings the antigens 
into specialized vesicles called endosomes and subsequently to specialized vesicles called 
lysosomes, where the antigen is degraded by proteases into antigen peptides that bind to 
MHC class n molecules. The antigen peptide-MHC class II molecule complex is then 

.30 transported to the cell surface for presentation to helper T lymphocytes of the immune 
system. 

A variety of factors must be considered in the development of an effective 
vaccine. For example, the extent of activation of either the humoral or cell-mediated 
branch of the immune system can determine the effectiveness of a vaccine agauist a 
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particular disease. Furthermore, the development of immunologic memory by inducing 
memory-cell formation can be important for an effective vaccine against a particular 
disease (Kuby, supra). For example, protection from infectious diseases caused by 
pathogens with short incubation periods, such as influenza virus, requires high levels of 
5 neutralizing antibody generated by the humoral branch because disease symptoms are 
already underway before memory cells are activated. Alternatively, protection from 
infectious diseases caused by pathogens with long incubation periods, such as polio virus, 
does not require neutralizing antibodies at the time of infection but instead requires 
memory B cells that can generate neutralizing antibodies to combat the pathogen before it 

10 is able to infect target tissues. Therefore, the effectiveness of a vaccine at preventing or 
ameliorating the symptoms of a particular disease depends on the type of immime 
response generated by the vaccine. 

Many traditional vaccines have relied on intact pathogens such as 
attenuated or inactivated viruses or bacteria to elicit an irranune response. However, 

15 these traditional vaccines have advantages and disadvantages, including reversion of an 
attenuated pathogen to a virulent form. The problem of reversion of an attenuated 
vaccine has been addressed by the use of molecules of the pathogen rather than the whole 
pathogen. For example, immunization approaches have begun to incorporate 
recombinant vector vaccines and synthetic peptide vaccines (Kuby, supra). Recently, 

20 DNA vaccines have also been used (Donnelly et al, Annu. Rev. Immunol. 15:617-648 
(1 997), which is incorporated herein by reference). The use of molecules of a pathogen 
provides safe vaccines that circumvent the potential for reversion to a virulent form of the 
vaccine. 

The targeting of antigens to MHC class 11 molecules to activate helper T 
25 lymphocytes has been described using lysosomal targeting sequences, which direct 

antigens to lysosomes, where the antigen is digested by lysosomal proteases into antigen 
peptides that bind to MHC class 11 molecules (U.S. Patent No. 5,633,234; Thomson et al, 
J. Virol. 72:2246-2252 (1998)). It would be advantageous to develop vaccines that 
deUver multiple antigens while exploiting the safety provided by administering individual 
30 epitopes of a pathogen rather than a whole organism, hi particular, it would be 
advantageous to develop vaccines that effectively target antigens to MHC class II 
molecules for activation of helper T lymphocytes. 

Several studies also point to the crucial role of cytotoxic T cells in both 
production and eradication of infectious diseases and cancer by the immune system 
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(Byrne et al, J. Immunol. 51:682 (1984); McMichael et at, K Engl. J. Med. 309:13 
(1983)). Recombinant protein vaccines do not reliably induce CTL responses, and the 
use of otherwise inxmunogenic vaccines consisting of attenuated pathogens in humans is 
hampered, in the case of several important diseases, by overriding safety concerns. In the 
5 case of diseases such as HIV, HBV, HCV, and malaria, it appears desirable not only to 
induce a vigorous CTL response, but also to focus the response against highly conserved 
epitopes in order to prevent escape by mutation and overcome variable vaccine efficacy 
against different isolates of the target pathogen. 

Induction of a broad response directed simultaneously against multiple 

10 epitopes also appears to be crucial for development of efficacious vaccines. HIV 
infection is perhaps the best example where an infected host may benefit from a 
multispecific response. Rapid progression of HIV infection has been reported in cases 
where a narrowly focused CTL response is induced whereas nonprogressors tend to show 
a broader specificity of CTLs (Goulder et al, Nat. Med. 3:212 (1997); Borrow et al., Nat. 

15 Med. 3:205 (1997)). The highly variable nature of HIV CTL epitopes resulting from a 
highly mutating genome and selection by CTL responses directed against only a single or 
few epitopes also supports the need for broad epitope CTL responses (McMichael et al, 
Annu. Rev. Immunol 15:271 (1997)). 

One potential approach to induce multispecific responses against 

20 conserved epitopes is immunization with a rainigene plasmid encoding the epitopes in a 
string-of-beads fashion. Induction of CTL, HTL, and B cell responses in mice by 
minigene plasmids have been described by several laboratories using constructs encoding 
as many as 11 epitopes (An et al, J. Virol 71:2292 (1997); Thomson et al, J. Immunol. 
157:822 (1996); Whitton et al, J. Virol. 67:348 (1993); Hanke et al. Vaccine 16:426 

25 (1998); Vitiello et al. Eur. J. Immunol 27:671-678 (1997)). Mmigenes have been 

delivered in vivo by infection with recombinant adenovirus or vaccinia, or by injection of 
purified DNA via the intramuscular or intradermal route (Thomson et al, J. Immunol 
160:1717 (1998); Toes et al, Proc. Natl Acad ScL USA 94:14660 (1997)). 

Successfiil development of minigene DNA vaccines for human use will 

30 require addressing certain fimdamental questions dealing with epitope MHC affinity, 
optimization of constructs for maximum in vivo immunogenicity, and development of 
assays for testing in vivo potency of multi-epitope minigene constructs. Regarding MHC 
binding affinity of epitopes, it is not currently known whether both high and low affinity 
epitopes can be included within a single minigene construct, and what ranges of peptide 
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affinity are pennissible for CTL induction in vivo. This is especially important because 
dominant epitopes can vary in their affinity and because it might be important to be able 
to deliver mixtures of dominant and subdominant epitopes that are characterized by high 
and low MHC binding affinities. 
5 With respect to minigene construct optimization for maximum 

immunogenicity in vivo, conflicting data exists regarding whether the exact position of 
the epitopes in a given construct or the presence of flanking regions, helper T cell 
epitopes, and signal sequences might be crucial for CTL induction (Del Val et al. Cell 
66:1145 (1991); Bergmann et al, J. Virol. 68:5306 (1994); Thomson et al, Proc. Natl 

10 Acad. Sci. USA 92:5845 (1995); Shirai et al.. J. Infect. Dis. 173:24 (1996); Rahemtulla et 
al. Nature 353:180 (1991); Jennings et al, Cell Immunol 133:234 (1991); Anderson et 
al, J. Exp. Med. 174:489 (1991); Uger et al, J. Immunol. 158:685 (1997)). Finally, 
regarding development of assays that allow testing of human vaccine candidates, it should 
be noted that, to date, all in vivo immunogenicity data of multi-epitope minigene plasmids 

1 5 have been performed with murine class I MHC-restricted epitopes. It would be 

advantageous to be able to test the in vivo immunogenicity of minigenes containing 
human CTL epitopes in a convenient animal model system. 

Thus, there exists a need to develop methods to effectively deUver a 
variety of HTL (helper T lymphocyte) and CTL (cytotoxic T lymphocyte) antigens to 

20 stimulate an immune response. The present invention satisfies this need and provides 
related advantages as well. 

SUMMARY OF THE INVENTION 
The invention therefore provides expression vectors encoding two or more 

25 HTL epitopes fiised to a MHC class II targeting sequence, as well as expression vectors 
encoding a CTL epitope and a universal HTL epitope fused to an MHC class I targeting 
sequence. The HTL epitope can be a universal HTL epitope (also referred to as a 
universal MHC class 11 epitope). The invention also provides expression vectors 
encoding two or more HTL epitopes fused to a MHC class II targeting sequence and 

30 encoding one or more CTL epitopes. The invention additionally provides methods of 

stimulating an immune response by administering an expression vector of the invention in 
vivo, as well as methods of assaying the human immunogenicity of a human T cell 
peptide epitope in vivo in a non-human mammal. 
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In one aspect, the present invention provides an expression vector 
comprising a promoter operably linked to a first nucleotide sequence encoding a major 
histocompatibility (MHC) targeting sequence fused to a second nucleotide sequence 
encoding two or more heterologous peptide epitopes, wherein the heterologous peptide 
5 epitopes comprise two HTL peptide epitopes or a CTL peptide epitope and a universal 
HTL peptide epitope. 

In another aspect, the present invention provides a method of inducing an 
immune response in vivo comprising administering to a mammalian subject an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 

10 major histocompatibility (MHC) targeting sequence fused to a second nucleotide 

sequence encoding two or more heterologous peptide epitopes, wherein the heterologous 
peptide epitopes comprise two HTL peptide epitopes or a CTL peptide epitope and a 
universal HTL peptide epitope. 

In another aspect, the present invention provides a method of inducing an 

15 immune response in vivo comprising administering to a mammalian subject an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 
major histocompatibility (MHC) targeting sequence fused to a second nucleotide 
sequence encoding a heterologous human HTL peptide epitope. 

In another aspect, the present invention provides a method of assaying the 

20 human immunogenicity of a human T cell peptide epitope in vivo in a non-human 

mammal, comprising the step of administering to the non-human mammal an expression 
vector comprising a promoter operably linked to a first nucleotide sequence encoding a 
heterologous human CTL or HTL peptide epitope. 

In one embodiment, the heterologous peptide epitopes comprise two or 

25 more heterologous HTL peptide epitopes. In another embodiment, the heterologous 
peptide epitopes comprise a CTL peptide epitope and a universal HTL peptide epitope. 
In another embodiment, the heterologous peptide epitopes further comprise one to two or 
more heterologous CTL peptide epitopes. In another embodiment, the expression vector 
comprises both HTL and CTL peptide epitopes. 

30 In one embodiment, one of the HTL peptide epitopes is a universal HTL 

epitope. In another embodiment, the universal HTL epitope is a pan DR epitope. In 
another embodiment, the pan DR epitope has the sequence 
AlaLysPheValAlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 



7 
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In one embodiment, the peptide epitopes are hepatitis B virus epitopes, 
hepatitis C virus epitopes, himian immunodeficiency virus epitopes, human papilloma 
virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, PAP epitopes, p53 
epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. In another 
5 embodiment, the peptide epitopes each have a sequence selected from the group 

consisting of the peptides depicted in Tables 1-8. In another embodiment, at least one of 
the peptide epitopes is an analog of a peptide depicted in Tables 1-8. 

In one embodiment, the MHC targeting sequence comprises a region of a 
polypeptide selected from the group consisting of the li protein, LAMP -I, HLS-DM, 
1 0 HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface antigen, hepatitis B virus 
core antigen, Ty panicle, Ig-a protein, Ig-P protein, and Ig kappa chain signal sequence. 

In one embodiment, the expression vector further comprises a second 
promoter sequence operably linuked to a third nucleotide sequence encoding one or more 
heterologous HTL or CTL peptide epitopes. In another embodiment, the CTL peptide 
1 5 epitope comprises a structural motif for an HL A supertype, whereby the peptide CTL 
epitope binds to two or more members of the supertype with an affinity of greater that 
500 nM. In another embodiment, the CTL peptide epitopes have structural motifs that 
provide binding affinity for more than one HLA allele supertype. 

In one embodiment, the non-human mammal is a transgenic mouse that 
20 expresses a human HLA allele. In another embodiment, the human HLA allele is selected 
from the group consisting of All and A2.1 . In another embodiment, the non-human 
mammal is a macaque that expresses a human HLA allele. 



BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figure 1 shows the nucleotide and amino acid sequences (SEQ ID N0S:1 

and 2, respectively) of the liPADRE construct encoding a flision of the murine li gene 
with a pan DR epitope sequence substituted for the CLIP sequence of the li protein. 

Figure 2 shows the nucleotide and amino acid sequences (SEQ ID N0S:3 
and 4, respectively) of the I80T construct encoding a fiasion of the cytoplasmic domain, 
30 the transmembrane domain and part of the luminal domain of the li protein fused to 
multiple MHC class II epitopes. 

Fig\u:e 3 shows the nucleotide and ammo acid sequences (SEQ ID N0S:5 
and 6, respectively) of the liThfiiU construct encoding a fusion of the cytoplasmic 
domain, transmembrane domain and a portion of the luminal domain of the li protein 
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fused to multiple T helper epitopes and amino acid residues 101 to 215 of the li protein, 
which encodes the trimerization region of the li protein. 

Figure 4 shows the nucleotide and amino acid sequences (SEQ ID NOS:7 
and 8, respectively) of the KappaLAMP-Th construct encoding a fusion of the murine 
5 immunoglobulin kappa signal sequence fused to multiple T helper epitopes and the 
transmembrane and cytoplasmic domains of LAMP-1. 

Figure 5 shows the nucleotide and amino acid sequences (SEQ ID N0S:9 
and 10, respectively) of the H2M-Th construct encoding a fusion of the signal sequence 
of H2-M fused to multiple MHC class II epitopes and the transmembrane and 
10 cytoplasmic domains of H2-M. 

Figure 6 shows the nucleotide and amino acid sequences (SEQ ID N0S:1 1 
and 12, respectively) of the H20-Th construct encoding a fusion of the signal sequence of 
H2-DO fused to multiple MHC class n epitopes and the transmembrane and cytoplasmic 
domains of H2-D0. 

15 Figure 7 shows the nucleotide and amino acid sequences (SEQ ED NOS:13 

and 14, respectively) of the PADRE-Influenza matrix construct encoding a fusion of a 
pan DR epitope sequence fused to the amino-terminus of influenza matrix protein. 

Figure 8 shows the nucleotide and amino acid sequences (SEQ ID N0S:15 
and 16, respectively) of the PADRE-HBV-s construct encoding a fusion of a pan DR 
20 epitope sequence fused to the amino-terrainus of hepatitis B virus surface antigen. 

Figure 9 shows the nucleotide and amino acid sequences (SEQ ID NOS:17 
and 18, respectively) of the Ig-alphaTh construct encoding a fusion of the signal sequence 
of the Ig-a protein fused to multiple MHC class II epitopes and the transmembrane and 
cytoplasmic domains of the Ig-a protein. 
25 Figure 10 shows the nucleotide and amino acid sequences (SEQ ID 

N0S:19 and 20, respectively) of the Ig-betaTh construct encoding a fusion of the signal 
sequence of the Ig-P protein ftised to multiple MHC class 11 epitopes and the 
transmembrane and cytoplasmic domains of the Ig-p protein. 

Figure 1 1 shows the nucleotide and amino acid sequences (SEQ ID 
30 N0S:21 and 22, respectively) of the SigTh construct encoding a fusion of the signal 
sequence of the kappa immxmoglobulin fused to multiple MHC class 11 epitopes. 

Figure 12 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:23 and 24, respectively) of human HLA-DR, the invariant chain (li) protein. 
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Figure 13 shows the nucleotide and amino acid sequences (SEQ ID 
NOS:25 and 26, respectively) of human lysosomal membrane glycoprotein- 1 (LAMP-1). 

Figure 14 shows the nucleotide and amino acid sequences (SEQ ED 
NOS:27 and 28, respectively) of human HLA-DMB. 
5 Figure 1 5 shows the nucleotide and amino acid sequences (SEQ ID 

NOS:29 and 30, respectively) of human HLA-DO beta. 

Figure 16 shows the nucleotide and amino acid sequences (SEQ ID 
NOS;3 1 and 32, respectively) of the human MB-1 Ig-a. 

Figure 17 shows the nucleotide and amino acid sequences (SEQ ID 
10 NOS:33 and 34, respectively) of human Ig-P protein. 

Figure 18 shows a schematic diagram depicting the method of generating 
some of the constructs encoding a MHC class 11 targeting sequence fused to multiple 
MHC class II epitopes. 

Figure 19 shows the nucleotide sequence of the vector pEP2 (SEQ ID 

15 NO:35). 

Figure 20 shows the nucleotide sequence of the vector pMIN.O (SEQ ID 

NO:36). 

Figure 21 shows the nucleotide sequence of the vector pMIN.l (SEQ ID 

NO:37). 

20 Figure 22. Representative CTL responses in HLA-A2. l/K^-H-2''''' mice 

immunized with pMin.l DNA. Splenoc5^es from primed animals were cultured in 
triplicate flasks and stimulated twice in vitro with each peptide epitope. Cytotoxicity of 
each culture was assayed in a ^'Cr release assay against Jurkat-A2.1/K'' target cells in the 
presence (filled symbols, soUd lines) or absence (open symbols, dotted lines) of peptide. 

25 Each symbol represents the response of a single culture. 

Figure 23. Presentation of viral epitopes to specific CTLs by Jurkat- 
A2. 1/K'' tumor cells transfected with DNA minigene. Two constructs were used for 
transfection,.pMin.l and pMin.2-GFP. pMin.2-GFP-transfected targets cells were sorted 
by FACS and the population used in this experiment contained 60% fluorescent cells. 

30 CTL stimulation was measured by quantitating the amount of IFN-y release (A, B) or by 
lysis of ^'Cr-Iabeled target cells (C, D, hatched bars). CTLs were stimulated with 
transfected cells (A, C) or with parental Jurkat-A2,l/K'' cells in the presence of I |ig/ml 
peptide (B, D). Levels of IFN- y release and cytotoxicity for the different CTL lines in 
the absence of epitope ranged from 72-126 pg/ml and 2-6% respectively. 
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Figure 24. Summary of modified minigene constructs used to address 
variables critical for in vivo immunogenicity. The following modifications were 
incorporated into the prototype pMin.l construct; A, deletion of PADRE HTL epitope; B, 
incorporation of the native HBV Pol 551 epitope that contains an alanine in position 9; C, 
5 deletion of the Ig kappa signal sequence; and D, switching position of the HBV Env 335 
and HBV Pol 455 epitopes. 

Figure 25. Examination of variables that may influence pMin.l 
immunogenicity. In vivo CTL-inducing activity of pMin.l is compared to modified 
constructs. For ease of comparison, the CTL response induced fay each of the modified 

1 0 DNA minigene constructs (shaded bars) is compared separately in each of the four panels 
to the response induced by the prototype pMin.l construct (solid bars). The geometric 
mean response of CTL-positive cultures from two to five independent experiments are 
shown. Numbers shown with each bar indicate the number of positive cultures/total 
number tested for that particular epitope. The ratio of positive cultures/total tested for the 

15 pMin.l group is shown in panel A and is the same for the remaining Figure panels (see 
Example V, Materials and Methods, in vitro CTL cultures, for the definition of a positive 
CTL culture). Theradigm responses were obtained by immunizing animals with the 
lipopeptide and stimulating and testing splenocyte cultures with the HBV Core 18-27 
peptide. 

20 

DEFINITIONS 

An "HTL" peptide epitopeor an "MHC II epitope" is an MHC class II 
restricted epitope, i.e., one that is bound by an MHC class II molecule. 

A "CTL" jpeptide epitope or an "MHC I epitope" is an MHC class I 
25 restricted epitope, i.e., one that is bound by an MHC class I molecule. 

An "MHC targeting sequence" refers to a peptide sequence that targets a 
polypeptide, e.g., comprising a peptide epitope, to a cytosolic pathway (e.g., an MHC 
class I antigen processing pathway), en endoplasmic reticulum pathwasy, or an endocytic 
pathway (e.g., an MHC class II antigen processing pathway). 
30 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences that are not 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences firom unrelated genes 
arranged to make a new fimctional nucleic acid, e.g., a promoter from one source and a 
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coding region from another source. Similarly, a heterologous protein indicates that the 
protein comprises two or more subsequences that are not found in the same relationship to 
each other in nature, e.g., a fusion polypeptide comprising subsequence from different 
polypeptides, peptide epitopes from the same polypeptide that are not naturally in an 
5 adjacent position, or repeats of a single peptide epitope. 

As used herein, the term "universal MHC class II epitope" or a "vmiversal 
HTL epitope" refers to a MHC class II peptide epitope that binds to gene products of 
multiple MHC class 11 alleles. For example, the DR, DP and DQ alleles are human MHC 
II alleles. Generally, a unique set of peptides binds to a particular gene product of a MHC 

10 class II allele. In contrast, a universal MHC class II epitope is able to bind to gene 

products of multiple MHC class II alleles. A universal MHC class II epitope binds to 2 or 
more MHC class II alleles, generally 3 or more MHC class II alleles, and particularly 5 or 
more MHC class H alleles. Thus, the presence of a universal MHC class 11 epitope in an 
expression vector is advantageous in that it functions to increase the number of allelic 

15 MHC class II molecules that can bind to the peptide and, consequently, the number of 
Helper T lymphocytes that are activated. 

Universal MHC class II epitopes are well known in the art and include, for 
example, epitopes such as the "pan DR epitopes," also referred to as "PADRE" 
(Alexander et al, Immunity 1:751-761 (1994); WO 95/07707, USSN 60/036,713, USSN 

20 60/037,432, PCT,^'S98/0 1 373, 09/009,953, and USSN 60/087, 1 92 each of which is 

incorporated herein by reference). A "pan DR binding peptide" or a "PADRE" peptide of 
the invention is a peptide capable of binding at least about 7 different DR molecules, 
preferably 7 of the 12 most common DR molecules, most preferably 9 of the 12 most 
common DR molecules (DRl, 2w2b, 2w2a, 3, 4w4, 4wH, 5, 7, 52a, 52b, 52c, and 53), or 

25 altematively, 50% of a panel of DR molecules representative of greater than or equal to 
75% of the human population, preferably greater than or equal to 80% of the human 
population. Pan DR epitopes can bind to a number of DR alleles and are strongly 
immunogenic for T cells. For example, pan DR epitopes were foimd to be more effective 
at inducing an immuae response than natural MHC class II epitopes (Alexander, supra). 

30 An example of a PADRE epitope is the peptide 

AlaLysPheValAIaAlaTrpThrLeuLysAlaAlaAla (SEQ ID NO:38) (for additional 
examples of PADRE epitopes, see Table 8 of TTC docket No. 018623-006221, filed May 
12, 1999, USSN _, herein incorporated by reference in its entirety). 
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With regard to a particular amino acid sequence, an "epitope" is a set of 
amino acid residues which is involved in recognition by a particular uimiunoglobulin, or 
in the context of T cells, those residues necessary for recognition by T cell receptor 
proteins and/or Major Histocompatibility Complex (MHC) receptors. In an immune 
5 system setting, in vivo or in vitro, an epitope is the collective features of a molecule, such 
as primary, secondary and tertiary peptide structure, and charge, that together form a site 
recognized by an immunoglobulin, T cell receptor or HLA molecule. Throughout this 
disclosure epitope and peptide are often used interchangeably. It is to be appreciated, 
however, that isolated or purified protein or peptide molecules larger than and comprising 

10 an epitope of the invention are still within the bounds of the invention. 

As used herein, "high affinity" with respect to HLA class I molecules is 
defined as binding with an IC50 (or Kd) of less than 50 nM. "Intermediate affinity" is 
binding with an IC50 (or Kd) of between about 50 and about 500 nM. "High affinity" 
with respect to binding to HLA class II molecules is defined as binding with an Kd of 

15 less than 100 nM. "Intermediate affinity" is binding with a Kd of between about 100 and 
about 1000 nM. Assays for determining binding are described in detail, e.g., in PCT 
publications WO 94/20127 and WO 94/03205. Alternatively, bindmg is expressed 
relative to a reference peptide. As a particular assay becomes more, or less, sensitive, the 
IC50s of the peptides tested may change somewhat. However, the binding relative to the 

20 reference peptide will not significantly change. For example, in an assay run under 

conditions such that the IC50 of the reference peptide increases lO-fold, the IC50 values 
of the test peptides will also shift approximately 10-foId. Therefore, to avoid ambiguities, 
the assessment of whether a peptide is a good, intermediate, weak, or negative binder is 
generally based on its IC50, relative to the IC50 of a standard peptide. 

25 Throughout this disclosure, results are expressed in terms of "IC50s." 

IC50 is the concentration of peptide in a binding assay at which 50% inhibition of binding 
of a reference peptide is observed. Given the conditions in which the assays are run (i.e., 
limiting HLA proteins and labeled peptide concentrations), these values approximate KD 
values. It should be noted that IC50 values can change, often dramatically, if the assay 

30 conditions are varied, and depending on the particular reagents used (e.g., HLA 

preparation, etc). For example, excessive concentrations of HLA molecules will increase 
the apparent measured IC50 of a given ligand. 

The terms "identical" or percent "identity," in the context of two or more 
peptide sequences, refer to two or more sequences or subsequences that are the same or 
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have a specified percentage of amino acid residues that are the same, when compared and 
aligned for maximum correspondence over a comparison window, as measured using a 
sequence comparison algorithms using default program parameters or by manual 
alignment and visual inspection. 
5 The phrases "isolated" or "biologically pure" refer to material which is 

substantially or essentially free from components which normally accompany the material 
as it is found in its native state. Thus, isolated peptides in accordance with the invention 
preferably do not contain materials normally associated with the peptides in their in situ 
environment. 

1 0 "Major histocompatibility complex" or "MHC" is a cluster of genes that 

plays a role in control of the cellular interactions responsible for physiologic immune 
responses. In humans, the MHC complex is also known as the HLA complex. For a 
detailed description of the MHC and HLA complexes, see Paul, Fundamental 
Immunology (3rd ed. 1993). 

1 5 "Human leukocyte antigen" or "HLA" is a human class I or class II major 

histocompatibility complex (MHC) protein (see, e.g., Stites, et aL, Immunology, (8th ed., 
1994). 

An "HLA supertype or family", as used herein, describes sets of HLA 
molecules grouped on the basis of shared peptide-binding specificities. HLA class I 

20 molecules that share somewhat similar binding affinity for peptides bearing certain amino 
acid motifs are grouped into HLA supertypes. The terms HLA superfamily, HLA 
supertype family, HLA family, and HLA xx-like supertype molecules (where xx denotes 
a particular HLA type), are synon)mis. 

The term "motif refers to the pattem of residues in a peptide of defined 

25 length, usually a peptide of from about 8 to about 13 amino acids for a class I HLA motif 
and from about 6 to about 25 amino acids for a class 11 HLA motif, which is recognized 
by a particular HLA molecule. Peptide motifs are typically different for each protein 
encoded by each human HLA allele and differ in the pattem of the primary and secondary 
anchor residues. 

30 A "supermotif is a peptide binding specificity shared by HLA molecules 

encoded by two or more HLA alleles. Thus, a preferably is recognized with high or 
intermediate affinitv' (as defmed herein) by two or more BDLA antigens. 

"Cross-reactive binding" indicates that a peptide is bound by more than 
one HLA molecule; a synonym is degenerate binding. 
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The terra "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other, typically by peptide bonds between the a-amino and carboxyl groups of 
adjacent amino acids. The preferred CTL-inducing oligopeptides of the invention are 13 
5 residues or less in length and usually consist of between about 8 and about 1 1 residues, 
preferably 9 or 10 residues. The preferred HTL-inducing oligopeptides are less than 
about 50 residues in length and usually consist of between about 6 and about 30 residues, 
more usually between about 12 and 25, and often between about 15 and 20 residues. 

An "immunogenic peptide" or "peptide epitope" is a peptide which 
1 0 comprises an allele-specific motif or supermotif such that the peptide will bind an HLA 
molecule and induce a CTL and/or HTL response. Thus, immunogenic peptides of the 
invention are capable of binding to an appropriate HLA molecule and thereafter inducing 
a cytotoxic T cell response, or a helper T cell response, to the antigen from which the 
immunogenic peptide is derived. 
15 A "protective immune response" refers to a CTL and/or an HTL response 

to an antigen derived from an infectious agent or a tumor antigen, which prevents or at 
least partially arrests disease symptoms or progression. The immune response may also 
include an antibody response which has been facilitated by the stimulation of helper T 
cells. 

20 The term "residue" refers to an amino acid or amino acid mimetic 

incorporated into an oligopeptide by an amide bond or amide bond mimetic. 

"Sv-nthetic peptide" refers to a peptide that is not naturally occurring, but is 
man-made using such methods as chemical S3mthesis or recombinant DNA technology. 

The nomenclature used to describe peptide compounds follows the 

25 conventional practice wherein the amino group is presented to the left (the N-terminus) 
and the carboxyl group to the right {the C-terminus) of each amino acid residue. When 
amino acid residue positions are referred to in a peptide epitope they are numbered in an 
amino to carboxyl direction with position one being the position closest to the amino 
terminal end of the epitope, or the peptide or protein of which it may be a part. In the 

30 formulae representmg selected specific embodiments of the present invention, the amino- 
and carboxyl-terminal groups, although not specifically shown, are in the form they 
would assume at physiologic pH values, unless otherwise specified. In the amino acid 
structure formulae, each residue is generally represented by standard three letter or single 
letter designations. The L-form of an amino acid residue is represented by a capital single 
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letter or a capital first letter of a three-letter symbol, and the D-fonii for those amino acids 
having D-forms is represented by a lower case single letter or a lower case three letter 
symbol. Glycine has no asymmetric carbon atom and is simply referred to as "Gly" or G. 

As used herein, the term "expression vector" is intended to refer to a 
5 nucleic acid molecule capable of expressing an antigen of interest such as a MHC class I 
or class 11 epitope in an appropriate target cell. An expression vector can be, for example, 
a plasmid or virus, including DNA or RNA viruses. The expression vector contains such 
a promoter element to express an antigen of interest in the appropriate cell or tissue in 
order to stimulate a desired immune response. 

10 

DETAILED DESCRIPTION OF THE INVENTION 
Cytotoxic T lymphocytes (CTLs) and helper T lymphocytes (HTLs) are 
critical for immunir.' against infectious pathogens; such as viruses, bacteria, and protozoa; 
tumor cells; autoimmunne diseases and the like. The present invention provides 

15 minigenes that encode peptide epitopes which induce a CTL and/or HTL response. The 
minigenes of the invention also include an MHC targeting sequence. A variety of 
minigenes encoding different epitopes can be tested for immunogenicity using an HLA 
transgenic mouse. The epitopes are typically a combination of at least two or more HTL 
epitopes, or a CTL epitope plus a universal HTL epitope, and optinally include additional 

20 HTl and/or CTL epitopes. Two, three, four, five, six, seven, eight, nine, ten, twenty, 
thirty, forty or about fifty different epitopes, either HTL and/or CTL, can be included in 
the minigene, along with the MHC targeting sequence. The epitopes can have different 
HLA restriction. Epitopes to be tested include those derived from viruses such as HIV, 
HBV, HCV, HSV, CMV, HPV, and HTLV; cancer antigens such as p53, Her2/Neu, 

25 MAGE, PSA, hioman papilloma virus, and CEA; parasites such as Trypanosoma, 

Plasmodium, Leishmania, Giardia, Entamoeba; autoimmune diseases such as rheumatoid 
arthritis, myesthenia gravis, and lupus erythematosus; fungi such as Aspergillus and 
Candida; and bacteria such as Escherichia coli, Staphylococci, Chlamydia, Mycobacteria, 
Streptococci, and Pseudomonas. The epitopes to be encoded by the minigene are selected 

30 and tested using the methods described in pubUshed PCT applications WO 93/0742 1 , WO 
94/02353, WO 95/01000, WO 97/04451, and WO 97/05348, herein incorporated by 
reference. 



- 16 - 



wo 99/58658 



PCT/US99/10646 



HTL and CTL Epitopes 

The expression vectors of the invention encode one or more MHC class II 
and/or class I epitopes and an MHC targeting sequence. Multiple MHC class 11 or class I 
epitopes present in an expression vector can be derived from the same antigen, or the 
5 MHC epitopes can be derived from different antigens. For example, an expression vector 
can contain one or more MHC epitopes that can be derived from two different antigens of 
the same virus or from two different antigens of different viruses. Furthermore, any 
MHC epitope can be used in the expression vectors of the invention. For example, any 
single MHC epitope or a combination of the MHC epitopes shown in Tables 1 to 8 can be 

10 used in the expression vectors of the invention. Other peptide epitopes can be selected by 
one of skill in the art, e.g., by using a computer to select epitopes that contain HLA allele- 
specific motifs or supermotifs. The expression vectors of the invention can also encode 
one or more universal MHC class II epitopes, e.g., PADRE {see, e.g., SEQ ID NO:38 and 
Table 8 of TTC docket No. 018623-006221, filed May 12, 1999, USSN 

15 ). 

Universal MHC class 11 epitopes can be advantageously combined with 
other MHC class I and class II epitopes to increase the number of cells that are activated 
in response to a given antigen and provide broader population coverage of MHC-reactive 
alleles. Thus, the expression vectors of the invention can encode MHC epitopes specific 

20 for an antigen, universal MHC class II epitopes, or a combination of specific MHC 
epitopes and at least one universal MHC class II epitope. 

MHC class I epitopes are generally about 5 to 15 amino acids in length, in 
particular about 8 to 11 amino acids in length. MHC class II epitopes are generally about 
10 to 25 amino acids in length, in particular about 13 to 21 amino acids in length. A 

25 MHC class I or II epitope can be derived from any desired antigen of interest. The 
antigen of interest can be a viral antigen, surface receptor, tumor antigen, oncogene, 
enzyme, or any pathogen, cell or molecule for which an immune response is desired. 
Epitopes can be selected based on their ability to bind one or multiple HLA alleles, and 
can also be selected using the "analog" technique described below. 

30 

Targeting Sequences 

The expression vectors of the invention encode one or more MHC epitopes 
operably linked to a MHC targeting sequence. The use of a MHC targeting sequence 
enhances the immune response to an antigen, relative to delivery of antigen alone, by 
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directing the peptide epitope to the site of MHC molecule assembly and transport to the 
cell surface, thereby providing an increased number of MHC molecule-peptide epitope 
complexes available for binding to and activation of T cells. 

MHC class I targeting sequences are used in the present invention, e.g., 
5 those sequences that target an MHC class I epitope peptide to a cytosolic pathway or to 
the endoplasmic reticulum {see, e.g., Rammensee et al, Immunogenetics 41:178-228 
(1995)). For example, the cytosolic pathway processes endogenous antigens that are 
expressed inside the cell. Although not wishing to be bound by any particular theory, 
cytosolic proteins are thought to be at least partially degraded by an endopeptidase 

10 activity of a proteasome and then transported to the endoplasmic reticulum by the TAP 
molecule (transporter associated with processing). In the endoplasmic reticulum, the 
antigen binds to MHC class I molecules. Endoplasmic reticulum signal sequences bypass 
the cytosolic processing pathway and directly target endogenous antigens to the 
endoplasmic reticulum, where proteol3d;ic degradation into peptide fragments occurs. 

15 Such MHC class I targeting sequences are well known in the art, and include, e.g., signal 
sequences such as those from Ig kappa ,tissue plasminogen activator or insulin. A 
preferred signal peptide is the human Ig kappa chain sequence. Endoplasmic reticulum 
signal sequences can also be used to target MHC class 11 epitopes to the endoplasmic 
reticulum, the site of MHC class I molecule assembly. 

20 MHC class 11 targeting sequences are also used in the invention, e.g., those 

that target a peptide to the endocytic pathway. These targeting sequences typically direct 
extracellular antigens to enter the endocytic pathway, which results in the antigen being 
transferred to the lysosomal compartment where the antigen is proteolytically cleaved 
into antigen peptides for binding to MHC class II molecules. As with the normal 

25 processing of exogenous antigen, a sequence that directs a MHC class II epitope to the 
endosomes of the endocytic pathway and/or subsequently to lysosomes, where the MHC 
class n epitope can bind to a MHC class II molecule, is a MHC class 11 targeting 
sequence. For example, group of MHC class 11 targeting sequences useful in the 
invention are lysosomal targeting sequences, which localize polypeptides to lysosomes. 

30 Since MHC class n molecules typically bind to antigen peptides derived from proteolytic 
processing of endocytosed antigens in lysosomes, a lysosomal targeting sequence can 
frmction as a MHC class II targeting sequence. Lysosomal targeting sequences are well 
known in the art and include sequences found in the lysosomal proteins LAMP-1 and 
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LAMP-2 as described by August et al. (U.S. Patent No. 5,633,234, issued May 27, 1997), 
which is incorporated herein by reference. 

Other lysosomal proteins that contain lysosomal targeting sequences 
include HLA-DM. HLA-DM is an endosomal/lysosomal protein that functions in 
5 facilitating binding of antigen peptides to MHC class II molecules. Since it is located in 
the lysosome, HLA-DM has a lysosomal targeting sequence that can function as a MHC 
class II molecule targeting sequence (Copier et al, J. Immunol. 157:1017-1027 (1996), 
which is incorporated herein by reference). 

The resident lysosomal protein HLA-DO can also function as a lysosomal 

10 targeting sequence. In contrast to the above described resident lysosomal proteins 

LAMP-1 and HLA-DM, which encode specific Tyr-containing motifs that target proteins 
to lysosomes, HLA-DO is targeted to lysosomes by association with HLA-DM (Liljedahl 
et al, EMBO J. 15:4817-4824 (1996)), which is incorporated herein by reference. 
Therefore, the sequences of HLA-DO that cause association with HLA-DM and, 

15 consequently, translocation of HLA-DO to lysosomes can be used as MHC class II 

targeting sequences. Similarly, the murine homolog of HLA-DO, H2-D0, can be used to 
derive a MHC class II targeting sequence. A MHC class II epitope can be fused to HLA- 
DO or H2-D0 and targeted to lysosomes. 

In another example, the cytoplasmic domains of B cell receptor subunits 

20 Ig-a and Ig-P mediate antigen intemalization and increase the efficiency of antigen 

presentation (Bonnerot et al. Immunity 2:335-347 (1995)), which is incorporated herein 
by reference, Therefore, the cytoplasmic domains of the Ig-a and Ig-p proteins can 
function as MHC class II targeting sequences that target a MHC class 11 epitope to the 
endocytic pathway for processing and binding to MHC class II molecules. 

25 Another example of a MHC class II targeting sequence that directs MHC 

class II epitopes to the endoc5^ic pathway is a sequence that directs polypeptides to be 
secreted, where the polypeptide can enter the endosomal pathway. These MHC class II 
targeting sequences that direct polypeptides to be secreted mimic the normal pathway by 
which exogenous, extracellular antigens are processed into peptides that bind to MHC 

30 class II molecules. Any signal sequence that fimctions to direct a polypeptide through the 
endoplasmic reticulum and ultimately to be secreted can function as a MHC class II 
targeting sequence so long as the secreted polypeptide can enter the endosomal/lysosomal 
pathway and be cleaved into peptides that can bind to MHC class II molecules. An 
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example of such a fusion is shown in Figure 11, where the signal sequence of kappa 
immunoglobulin is fused to multiple MHC class II epitopes. 

In another example, the li protein binds to MHC class II molecules in the 
endoplasmic reticulum, where it functions to prevent peptides present in the endoplasmic 
5 reticulum from binding to the MHC class 11 molecules. Therefore, fusion of a MHC class 
II epitope to the li protein targets the MHC class II epitope to the endoplasmic reticulum 
and a MHC class II molecule. For example, the CLIP sequence of the li protein can be 
removed and replaced with a MHC class n epitope sequence so that the MHC class II 
epitope is directed to the endoplasmic reticulum, where the epitope binds to a MHC class 
10 II molecule. 

In some cases, antigens themselves can serve as MHC class II or I 
targeting sequences and can be fused to a universal MHC class II epitope to stimulate an 
immune response. Although cj^oplasmic viral antigens are generally processed and 
presented as complexes with MHC class I molecules, long-lived cytoplasmic proteins 

15 such as the influenza matrix protein can enter the MHC class II molecule processing 

pathway (Gueguen & Long, Proc. Natl. Acad. Sci. USA 93:14692-14697 (1996)), which 
is incorporated herein by reference. Therefore, long-lived cytoplasmic proteins can 
function as a MHC class II targeting sequence. For example, an expression vector 
encoding influenza matrix protein fused to a universal MHC class II epitope can be 

20 advantageously used to target influenza antigen and the universal MHC class II epitope to 
the MHC class II pathway for stimulating an immune response to influenza. 

Other examples of antigens functioning as MHC class 11 targeting 
sequences include polypeptides that spontaneously form particles. The polypeptides are 
secreted from the cell that produces them and spontaneously form particles, which are 

25 taken up into an antigen-presenting cell by endocytosis such as receptor-mediated 

endocytosis or are engulfed by phagocytosis. The particles are proteolytically cleaved 
into antigen peptides after entering the endosomal/lysosomal pathway. 

One such polypeptide that spontaneously forms particles is HBV surface 
antigen (HBV-S) piminsky et al., Vaccine 15:637-647 (1997); Le Borgne et al, 

30 Virology 240:304-3 15 (1998)), each of which is incorporated herein by reference. 

Another polypeptide that spontaneously forms particles is HBV core antigen (Kuhrober et 
al, International Immunol 9:1203-1212 (1997)), which is incorporated herein by 
reference. Still another polypeptide that spontaneously forms particles is the yeast Ty 
protein (Weber et al. Vaccine 13:831-834 (1995)), which is incorporated herein by 
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reference. For example, an expression vector containing HBV-S antigen fused to a 
universal MHC class n epitope can be advantageously used to target HBV-S antigen and 
the universal MHC class 11 epitope to the MHC class II pathway for stimulating an 
immune response to HBV. 

5 

Blading Affinity of Peptide Epitopes for HLA Molecules 

The large degree of HLA polymorphism is an important factor to be taken 
into accottnt with the epitope-based approach to vaccine development. To address this 
factor, epitope selection encompassing identification of peptides capable of binding at 

10 high or intermediate affinity to multiple HLA molecules is preferably utilized, most 
preferably these epitopes bind at high or intermediate affinity to two or more allele 
specific HLA molecules. 

CTL -inducing peptides of interest for vaccine compositions preferably 
include those that have a binding affinity for class I HLA molecules of less than 500 nM. 

15 HTL-inducing peptides preferably include those that have a binding affinity for class II 
HLA molecules of less than 1000 nM. For example, peptide binding is assessed by 
testing the capacity of a candidate peptide to bind to a purified HLA molecule in vitro. 
Peptides exhibiting high or intermediate affinity are then considered for further analysis. 
Selected peptides are tested on other members of the supertype family. In preferred 

20 embodiments, peptides that exhibit cross-reactive binding are then used in vaccines or in 
cellular screening analyses. 

Higher HLA binding affinity is typically correlated with greater 
immunogenicity. Greater immunogenicity can be manifested in several different ways. 
Immunogenicity corresponds to whether an immune response is elicited at all, and to the 

25 vigor of any particular response, as well as to the extent of a population in which a 

response is eUcited. For example, a peptide might elicit an immune response in a diverse 
array of the population, yet in no instance produce a vigorous response. In accordance 
with these principles, close to 90% of high binding peptides have been found to be 
immunogenic, as contrasted with about 50% of the peptides which bind with intermediate 

30 affmity. Moreover, higher binding affinity peptides leads to more vigorous immunogenic 
responses. As a result, less peptide is required to elicit a similar biological effect if a high 
affinity binding peptide is used. Thus, in preferred embodiments of the invention, high 
binding epitopes are particularly usefiiL 
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The relationship between binding affinity for HLA class I molecules and 
immunogenicity of discrete peptide epitopes on bound antigens has been determined for 
the first time in the art by the present inventors. The correlation between binding affinity 
and immunogenicity was analyzed in two different experimental approaches (Sette et at., 
5 J. Immunol. 153:5586-5592 (1994)). In the first approach, the immunogenicity of 

potential epitopes ranging in HLA binding affinity over a 10,000-fold range was analyzed 
in HLA-A*0201 transgenic mice, hi the second approach, the antigenicity of 
approximately 100 different hepatitis B virus (HBV)-derived potential epitopes, all 
carrying A*0201 binding motifs, was assessed by using PBL (peripheral blood 

10 lymphocytes) from acute hepatitis patients. Pursuant to these approaches, it was 

determined that an affinity threshold of approximately 500 nM (preferably 50 nM or less) 
determines the capacity of a peptide epitope to elicit a CTL response. These data are true 
for class I binding affinity measurements for naturally processed peptides and for 
synthesized T cell epitopes. These data also indicate the important role of determinant 

15 selection in the shaping of T cell responses (see. e.g., Schaeffer et al. Proc. Natl. Acad. 
Set USA 86:4649-4653, 1989). 

An affinity threshold associated with immunogenicity in the context of 
HLA class II DR molecules has also been delineated {see, e.g., Southwood et al. J. 
Immunology 160:3363-3373 (1998), and USSN 60/087192, filed 5/29/98). In order to 

20 define a biologically significant threshold of DR binding affinity, a database of the 

binding affinities of 32 DR-restricted epitopes for their restricting element (i.e., the HLA 
molecule that binds the motif) was compiled. In approximately half of the cases (15 of 32 
epitopes), DR restriction was associated with high binding affinities, i.e. binding affinities 
of less than 100 nM. hi the other half of the cases (16 of 32), DR restriction was 

25 associated with intermediate affinity (binding affinities in the 100-1000 nM range). In 
only one of 32 cases was DR restriction associated with an IC50 of 1000 nM or greater. 
Thus, 1000 nM can be defined as an affinity threshold associated with immunogenicity in 
the context of DR molecules. 

30 Peptide Epitope Binding Motifs and Supermotifs 

In the past few years evidence has acciraiulated to demonstrate that a large 
firaction of HLA class I and class II molecules can be classified into a relatively few 
supertypes, each characterized by largely overlapping peptide binding repertoires, and 
consensus structures of the main peptide binding pockets. 
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For HLA molecule pocket analyses, the residues comprising the B and F 
pockets of HLA class I molecules as described in crystallographic studies were analyzed 
(Guo et al. Nature 360:364 (1992); Saper et al, J. Mol Biol. 219:277 (1991); Madden et 
al, Ce// 75:693 (1993); Parham etal, Immunol. Rev. 143:141 (1995)). In these analyses, 
5 residues 9, 45, 63, 66, 67, 70, and 99 were considered to make up the B pocket; and the B 
pocket was deemed to determine the specificity for the amino acid residue in the second 
position of peptide ligands. Similarly, residues 77, 80, 8 1 , and 116 were considered to 
determine the specificity of the F pocket; the F pocket was deemed to determine the 
specificity for the C-terminal residue of a peptide ligand bound by the HLA class I 
10 molecule. 

Through the study of single amino acid substituted antigen analogs and the 
sequencing of endogenously bound, naturally processed peptides, critical residues 
required for allele-specific binding to HLA molecules have been identified. The presence 
of these residues correlates with binding affinity for HLA molecules. The identification 

15 of motifs and/or supermotifs that correlate with high and intermediate affinity binding is 
an important issue with respect to the identification of immunogenic peptide epitopes for 
the inclusion in a vaccine. Kast et al. (J. Immunol. 152:3904-3912 (1994)) have shown 
that motif-bearing peptides account for 90% of the epitopes that bind to allele-specific 
HLA class I molecules. In this study all possible peptides of 9 amino acids in length and 

20 overlapping by eight amino acids (240 peptides), which cover the entire sequence of the 
E6 and E7 proteins of human papillomavirus type 16, were evaluated for binding to five 
allele-specific HLA molecules that are expressed at high firequency among different 
ethnic groups. This unbiased set of peptides allowed an evaluation of the predictive value 
of HLA class I motifs. From the set of 240 peptides, 22 peptides were identified that 

25 bound to an allele-specific HLA molecules with high or intermediate affinity. Of these 
22 peptides, 20, (i.e., 91%), were motif-bearing. Thus, this study demonstrates the value 
of motifs for the identification of peptide epitopes for inclusion in a vaccine: application 
of motif-based identification techniques eliminates screening of 90% of the potential 
epitopes in a target antigen protein sequence. 

30 Peptides of the present invention may also include epitopes that bind to 

MHC class II DR molecules. There is a significant difference between class I and class 11 
HLA molecules. This difference corresponds to the fact that, although a stringent size 
restriction and motif position relative to the binding pocket exists for peptides that bind to 
class I molecules, a greater degree of heterogeneity in both size and binding firame 
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position of the motif, relative to the N and C termini of the peptide, exists for class II 
peptide ligands. 

This increased heterogeneity of HLA class II peptide ligands is due to the 
structure of the binding groove of the HLA class 11 molecule which, unlike its class I 
5 counterpart, is open at both ends. Crystallographic analysis of HLA class II DRB*0101- 
peptide complexes showed that the residues occupying position 1 and position 6 of 
peptides complexed with DRB*0101 engage two complementary pockets on the 
DRBa*0101 molecules, with the PI position corresponding to the most crucial anchor 
residue and the deepest hydrophobic pocket {see, e.g.. Madden, Ann. Rev. Immunol. 

10 13:587 (1995)). Other studies have also pointed to the P6 position as a crucial anchor 
residue for binding to various other DR molecules. 

Thus, peptides of the present invention are identified by any one of several 
HLA class I or II -specific amino acid motifs {see, e.g.. Tables I-III of USSN 09/226,775, 
and 09/239,043, herein incorporated by reference in their entirety). If the presence of the 

15 motif corresponds to the ability to bind several allele-specific HLA antigens it is referred 
to as a supermotif The allele-specific HLA molecules that bind to peptides that possess a 
particular amino acid supermotif are collectively referred to as an HLA "supertype." 

Immune Response-Stimulating Peptide Analogs 

20 In general, CTL and HTL responses are not directed against all possible 

epitopes. Rather, they are restricted to a few "immunodominant" determinants 
(Zinkemagel et al, Adv. Immunol. 27:5159 (1979); Bennink et al, J. Exp. Med. 
168:1935-1939 (1988); Rawle et al, J. Immunol. 146:3977-3984 (1991)). It has been 
recognized that imtnunodominance (Benacerraf et al. Science 175:273-279 (1972)) could 

25 be explained by either the ability of a given epitope to selectively bind a particular HLA 
protein (determinant selection theory) (Vitiello et al, J. Immunol. 131:1635 (1983)); 
Rosenthal et al, Nature 267:156-158 (1977)), or being selectively recognized by the 
existing TCR (T cell receptor) specificity (repertoire theory) (Klein, Immunology, The 
Science of Self on self Discrimination, pp. 270-310 (1982)). It has been demonstrated that 

30 additional factors, mostly linked to processing events, can also play a key role in 

dictating, beyond strict immunogenicity, which of the many potential determinants will 
be presented as immimodominant (Sercarz et al, Annu. Rev. Immunol 11:729-766 
(1993)). 
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The concept of dominance and subdominance is relevant to 
immunotherapy of both infectious diseases and cancer. For example, in the course of 
chronic viral disease, recruitment of subdominant epitopes can be important for 
successful clearance of the infection, especially if dominant CTL or HTL specificities 
5 have been inactivated by functional tolerance, suppression, mutation of viruses and other 
mechanisms (Franco et al, Curr. Opin. Immunol. 7:524-53 1 (1995)). In the case of 
cancer and tumor antigens, CTLs recognizing at least some of the highest binding affinity 
peptides might be functionally inactivated. Lower binding affinity peptides are 
preferentially recognized at these times, and may therefore be preferred in therapeutic or 

1 0 prophylactic anti-cancer vaccines. 

In panicular, it has been noted that a significant number of epitopes 
derived from known non-viral tumor associated antigens (TAA) bind HLA class I with 
intermediate affinity (IC50 in the 50-500 nM range). For example, it has been found that 
8 of 15 known TAA peptides recognized by tumor infiltrating lymphocytes (TIL) or CTL 

15 bound in the 50-500 nM range. (These data are in contrast with estimates that 90% of 
known viral antigens were bound by HLA class I molecules with IC50 of 50 nM or less, 
while only approximately 10% bound in the 50-500 nM range (Sette et al., J. Immunol., 
153:558-5592 (1994)). In the cancer setting this phenomenon is probably due to 
elimination, or functional inhibition of the CTL recognizing several of the highest binding 

20 peptides, presumably because of T cell tolerization events. 

Without intending to be bound by theory, it is believed that because T cells 
to dominant epitopes may have been clonally deleted, selecting subdominant epitopes 
may allow extant T cells to be recruited, which will then lead to a therapeutic or 
prophylactic response. However, the binding of HLA molecules to subdominant epitopes 

25 is often less vigorous than to dominant ones. Accordingly, there is a need to be able to 
modulate the binding affinity of particular immunogenic epitopes for one or more HLA 
molecules, and thereby to modulate the immune response elicited by the peptide, for 
example to prepare analog peptides which elicit a more vigorous response. This ability 
would greatly enhance the usefulness of peptide-based vaccines and therapeutic agents. 

30 Thus, although peptides with suitable cross-reactivity among all alleles of 

a superfamily are identified by the screening procedures described above, cross-reactivity 
is not always as complete as possible, and in certain cases procedures to further increase 
cross-reactivity of peptides can be useful; moreover, such procedures can also be used to 
modify other propenies of the peptides such as binding affinity or peptide stability. 
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Having established the general rules that govern cross-reactivity of peptides for HLA 
alleles within a given motif or supermotif, modification (i.e., analoging) of the structure 
of peptides of particular interest in order to achieve broader (or otherwise modified) HLA 
binding capacity can be performed. More specifically, peptides which exhibit the 
5 broadest cross-reactivity patterns, can be produced in accordance with the teachings 

herein. The present concepts related to analog generation are set forth in greater detail in 
co-pending USSN 09/226,775. 

In brief, the strategy employed utilizes the motifs or supermotifs which 
correlate with binding to certain HLA class I and II molecules. The motifs or supermotifs 

10 are defined by having primary anchors, and in many cases secondary anchors (see Tables 
I-III of USSN 09/226,775). Analog peptides can be created by substituting amino acids 
residues at primary anchor, secondary anchor, or at primary and secondary anchor 
positions. Generally, analogs are made for peptides that already bear a motif or 
supermotif Preferred secondary anchor residues of supermotifs and motifs that have 

1 5 been defined for HLA class I and class II binding peptides are shown in Tables II and HI, 
respectively, of USSN 09/226,775. 

For a number of the motifs or supermotifs in accordance with the 
invention, residues are defined which are deleterious to binding to allele-specific HLA 
molecules or members of HLA supertypes that bind to the respective motif or supermotif 

20 (see Tables 11 and III of USSN 09/226,775). Accordingly, removal of such residues that . 
are detrimental to binding can be performed in accordance with the methods described 
therein. For example, in the case of the A3 supertype, when all peptides that have such 
deleterious residues are removed fi:om the population of analyzed peptides, the incidence 
of cross-reactivity increases from 22% to 37% (I., Sidney et al, Hu. Immunol. 45:79 

25 (1996)). Thus, one strategy to improve the cross-reactivity of peptides within a given 
supermotif is simply to delete one or more of the deleterious residues present within a 
peptide and substitute a small "neutral" residue such as Ala (that may not influence T cell 
recognition of the peptide). An enhanced likelihood of cross-reactivity is expected if, 
together with elimination of detrimental residues within a peptide, "preferred" residues 

30 associated with high affmity binding to an allele-specific HLA molecule or to multiple 
HLA molecules within a superfamily are inserted. 

To ensure that an analog peptide, when used as a vaccine, actually elicits a 
CTL response to the native epitope in ^nvo (or, in the case of class II epitopes, a failure to 
elicit helper T cells that cross-react with the wild type peptides), the analog peptide may 
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be used to immunize T cells in vitro from'individuals of the appropriate HLA allele. 
Thereafter, the immunized cells' capacity to induce lysis of wild type peptide sensitized 
target cells is evaluated. In both class I and class n systems it will be desirable to use as 
targets, cells that have been either infected or transfected with the appropriate genes to 
5 establish whether endogenously produced antigen is also recognized by the relevant T 
cells. 

Another embodiment of the invention is to create analogs of weak binding 
peptides, to thereby ensure adequate numbers of cross-reactive cellular binders. Class I 
peptides exhibitmg binding affinities of 500-50000 nM, and carrying an acceptable but 

10 suboptimal primary anchor residue at one or both positions can be "fixed" by substituting 
preferred anchor residues in accordance with the respective supertype. The analog 
peptides can then be tested for crossbinding activity. 

Another embodiment for generating effective peptide analogs involves the 
substitution of residues that have an adverse impact on peptide stability or solubility in, 

15 e.g., a Uquid environment. This substitution may occur at any position of the peptide 
epitope. For example, a cysteine (C) can be substituted out in favor of gamma-amino 
butyric acid. Due to its chemical nature, cysteine has the propensity to form disulfide 
bridges and sufficiently alter the peptide structurally so as to reduce binding capacity. 
Substituting gamma-amino butyric acid for C not only alleviates this problem, but 

20 actually improves binding and crossbinding capability in certain instances (Sette et al, In: 
Persistent Viral Infections (Ahmed & Chen, eds., 1998)). Substitution of cysteine with 
gamma-amino butyric acid may occur at any residue of a peptide epitope, i.e., at either 
anchor or non-anchor positions. 



25 Expression Vectors and Construction of a Minigene 

The expression vectors of the invention contain at least one promoter 
element that is capable of expressing a transcription unit encoding the antigen of interest, 
for example, a MHC class I epitope or a MHC class 11 epitope and an MHC targeting 
sequence in the appropriate cells of an organism so that the antigen is expressed and 
30 targeted to the appropriate MHC molecule. For example, if the expression vector is 
administered to a mammal such as a human, a promoter element that functions in a 
hiraian cell is incorporated into the expression vector. An example of an expression 
vector useM for expressing the MHC class II epitopes fused to MHC class II targeting 
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sequences and the MHC class I epitopes described herein is the pEP2 vector described in 
Example IV. 

This invention relies on routine techniques in the field of recombinant 
genetics. Basic texts disclosing the general methods of use in this invention include 
5 Sambrook et ah, Molecular Cloning, A Laboratory Manual (2nd ed. 1989); ICriegler, 
Gene Transfer and Expression: A Laboratory Manual (1990); and Current Protocols in 
Molecular Biology (Ausubel et al., eds., 1994); Oligonucleotide Synthesis: A Practical 
Approach (Gait, ed., 1984); Kuijpers, Nucleic Acids Research 18(17):5197 (1994); 
Dueholm, /. Org. Chem. 59:5767-5773 (1994); Methods in Molecular Biology, volume 

10 20 (Agrawal, ed.); and Tijssen, Laboratory Techniques in Biochemistry and Molecular 
Biology— Hybridization with Nucleic Acid Probes, e.g., Part I, chapter 2 "Overview of 
principles of hybridization and the strategy of nucleic acid probe assays" (1993)). 

The minigenes are comprised of two or many different epitopes (see, e.g.. 
Tables 1-8). The nucleic acid encoding the epitopes are assembled in a mini gene 

1 5 according to standard techniques, hi general, the nucleic acid sequences encoding 
minigene epitopes are isolated using amplification techniques with oligonucleotide 
primers, or are chemically synthesized. Recombinant cloning techniques can also be used 
when appropriate. Oligonucleotide sequences are selected which either amplify (when 
using PGR to assemble the minigene) or encode (when using synthetic oligonucleotides to 

20 assemble the minigene) the desired epitopes. 

Amplification techniques using primers are typically used to amplify and 
isolate sequences encoding the epitopes of choice fi:om DNA or RNA (see U.S. Patents 
4,683,195 and 4,683,202; PCR Protocols: A Guide to Methods and Applications (hmis et 
al., eds, 1990)). Methods such as polymerase chain reaction (PCR) and ligase chain 

25 reaction (LCR) can be used to amphfy epitope nucleic acid sequences directly firom 

mRNA, from cDNA, firom genomic libraries or cDNA libraries. Restriction endonuclease 
sites can be incorporated into the primers. Minigenes amplified by the PCR reaction can 
be purified fi-om agarose gels and cloned into an appropriate vector. 

Synthetic oligonucleotides can also be used to construct minigenes. This 

30 method is performed using a series of overlapping oligonucleotides, representing both the 
sense and non-sense strands of the gene. These DNA fragments are then annealed, 
ligated and cloned. OUgonucleotides that are not conunercially available can be 
chemically synthesized according to the solid phase phosphoramidite triester method first 
described by Beaucage & Caruthers, Tetrahedron Letts. 22:1859-1862 (1981), using an 
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automated synthesizer, as described in Van Devatiter et. ah. Nucleic Acids Res. 12:6159- 
6168 (1984). Purification of oligonucleotides is by either native acrylamide gel 
electrophoresis or by anion-exchange HPLC as described in Pearson & Reanler, J. 
Chrom. 255:137-149 (1983). 
5 The epitopes of the minigene are typically subcloned into an expression 

vector that contains a strong promoter to direct transcription, as well as other regulatory 
sequences such as enhancers and polyadenylation sites. Suitable promoters are well 
known in the art and described, e.g., in Sambrook et al. and Ausubel et al. Eukaryotic 
expression systems for mammalian cells are well known in the art and are commercially 

10 available. Such promoter elements include, for example, cytomegalovirus (CMV), Rous 
sarcoma virus LTR and SV40. 

The expression vector typically contains a transcription unit or expression 
. cassette that contains all the additional elements required for the expression of the 
minigene in host cells. A typical expression cassette thus contains a promoter operably 

1 5 linked to the minigene and signals required for efficient polyadenylation of the transcript. 
Additional elements of the cassette may include enhancers and introns with functional 
splice donor and acceptor sites. 

In addition to a promoter sequence, the expression cassette can also 
contain a transcription termination region downstream of the structural gene to provide 

20 for efficient teiminaiion. The termination region may he obtained firom the same gene as 
the promoter sequence or may be obtained from different genes. 

The particular expression vector used to transport the genetic information 
into the cell is not particularly critical. Any of the conventional vectors used for 
expression in eukary otic cells may be used. Expression vectors containing regulatory 

25 elements from eukaryotic viruses are typically used in eukaryotic expression vectors, e.g., 
SV40 vectors, papilloma virus vectors, and vectors derived from Epstein Bar virus. Other 
exemplary eukaryotic vectors include pMSG, pAV009/A+, pMTO10/A+, pMAMneo-5, 
baculovirus pDSVE, and any other vector allowing expression of proteins under the 
direction of the SV40 early promoter, SV40 later promoter, metallothionein promoter, 

30 murine mammary tumor virus promoter, Rous sarcoma virus promoter, polyhedrin 

promoter, or other promoters shown effective for expression in eukaryotic cells. In one 
embodiment, the vector pEP2 is used in the present invention. 

Other elements that are tj^ically included in expression vectors also 
include areplicon that functions in E. coli, a gene encoding antibiotic resistance to permit 
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selection of bacteria that harbor recombinant plasmids, and unique restriction sites in 
nonessential regions of the plasmid to allow insertion of eukaryotic sequences. The 
particular antibiotic resistance gene chosen is not critical, any of the many resistance 
genes known in the art are suitable. The prokaryotic sequences are preferably chosen 
5 such that they do not interfere with the replication of the DNA in eukaryotic cells, if 
necessary. 

Administration /« Vivo 

The invention also provides methods for stimulating an immune response 

10 by administering an expression vector of the invention to an individual. Administration 
of an expression vector of the invention for stimulating an immune response is 
advantageous because the expression vectors of the invention target MHC epitopes to 
MHC molecules, thus increasing the number of CTL and HTL activated by the antigens 
encoded by the expression vector. 

1 5 Initially, the expression vectors of the invention are screened in mouse to 

determine the expression vectors having optimal activity in stimulating a desired immune 
response. Initial studies are therefore carried out, where possible, with mouse genes of 
the MHC targeting sequences. Methods of determining the activity of the expression 
vectors of the invention are well known in the art and include, for example, the uptake of 

20 ^H-thymidine to measure T cell activation and the release of ^^Cr to measure CTL activity 
as described below in Examples II and III. Experiments similar to those described in 
Example IV are performed to determine the expression vectors having activity at 
stimulating an immune response. The expression vectors having activity are further 
tested in human. To circumvent potential adverse immunological responses to encoded 

25 mouse sequences, the expression vectors having activity are modified so that the MHC 
class II targeting sequences are derived from human genes. For example, substitution of 
the analogous regions of the human homologs of genes containing various MHC class 11 
targeting sequences are substituted into the expression vectors of the invention. 
Examples of such human homologs of genes containing MHC class n targeting sequences 

30 are shown in Figures 12 to 17. Expression vectors containing human MHC class II 

targeting sequences, such as those described in Example I below, are tested for activity at 
stimulating an immune response in human. 

The invention also relates to pharmaceutical compositions comprising a 
pharmaceutically acceptable carrier and an expression vector of the invention. 
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Phannaceuticaily acceptable carriers are well known in the art and include aqueous or 
non-aqueous solutions, suspensions and emulsions, including physiologically buffered 
saline, alcohol/aqueous solutions or other solvents or vehicles such as glycols, glycerol, 
oils such as olive oil or injectable organic esters. 
5 A pharmaceutically acceptable carrier can contain physiologically 

acceptable compounds that act, for example, to stabilize the expression vector or increase 
the absorption of the expression vector. Such physiologically acceptable compounds 
include, for example, carbohydrates, such as glucose, sucrose or dextrans, antioxidants 
such as ascorbic acid or glutathione, chelating agents, low molecular weight polypeptides, 

10 antimicrobial agents, inert gases or other stabilizers or excipients. Expression vectors can 
additionally be complexed with other components such as peptides, polypeptides and 
carbohydrates. Expression vectors can also be complexed to particles or beads that can 
be administered to an individual, for example, using a vaccine gun. One skilled in the art 
would know that the choice of a pharma'ceutically acceptable carrier, including a 

15 physiologically acceptable compound, depends, for example, on the route of 
administration of the expression vector. 

The invention further relates to methods of administering a pharmaceutical 
composition comprising an expression vector of the invention to stimulate an immune 
response. The expression vectors are administered by methods well known in the art as 

20 described in Donnelly et al. {Ann. Rev. Immunol. 15:617-648 (1997)); Feigner et al. (U.S. 
Patent No. 5,580,859, issued December 3, 1996); Feigner (U.S. Patent No. 5,703,055, 
issued December 30, 1997); and Carson et al. (U.S. Patent No. 5,619M^, issued October 
21, 1997), each of which is incorporated herein by reference. In one embodiment, the 
minigene is administered as naked nucleic acid. 

25 A pharmaceutical composition comprising an expression vector of the 

invention can be administered to stimulate an immune response in a subject by various 
routes including, for example, orally, intravaginally, rectally, or parenterally, such as 
intravenously, intramuscularly, subcutaneously, intraorbitally, intracapsularly, 
intraperitoneally, intracistemally or by passive or facilitated absorption through the skin 

30 using, for example, a skin patch or transdermal iontophoresis, respectively. Fxirthermore, 
the composition can be administered by injection, intubation or topically, the latter of 
which can be passive, for example, by direct application of an ointment or powder, or 
active, for example, using a nasal spray or inhalant. An expression vector also can be 
administered as a topical spray, in which case one component of the composition is an 
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appropriate propellant. The pharmaceutical composition also can be incorporated, if 
desired, into liposomes, microspheres or other polymer matrices (Feigner et al, U.S. 
Patent No. 5,703,055; Gregoriadis, Liposome Technology, Vols. I to III (2nd ed. 1993), 
each of which is incorporated herein by reference). Liposomes, for example, which 
5 consist of phospholipids or other lipids, are nontoxic, physiologically acceptable and 
metabolizable carriers that are relatively simple to make and administer. 

The expression vectors of the invention can be delivered to the interstitial 
spaces of tissues of an animal body (Feigner et al., U.S. Patent Nos. 5,580,859 and 
5,703,055). Administration of expression vectors of the invention to muscle is a 

10 particularly effective method of administration, including intradermal and subcutaneous 
injections and transdermal administration. Transdeimal administration, such as by 
iontophoresis, is also an effective method to deliver expression vectors of the invention to 
muscle. Epidermal administration of expression vectors of the invention can also be 
employed. Epidermal administration involves mechanically or chemically irritating the 

15 outermost layer of epidermis to stimulate an immune response to the irritant (Carson et 
al, U.S. Patent No. 5,679,647). 

Other effective methods of administering an expression vector of the 
invention to stimulate an immune response include mucosal administration (Carson etal., 
U.S. Patent No. 5,679,647). For mucosal administration, the most effective method of 

20 administration includes intranasal administration of an appropriate aerosol containing the 
expression vector and a pharmaceutical composition. Suppositories and topical 
preparations are also effective for delivery of expression vectors to mucosal tissues of 
genital, vaginal and ocular sites. Additionally, expression vectors can be complexed to 
particles and administered by a vaccine gun. 

25 The dosage to be administered is dependent on the method of 

administration and will generally be between about 0.1 pg up to about 200 \x%. For 
example, the dosage can be firom about 0.05 pg/kg to about 50 mg/kg, in particular about 
0.005-5 mg/kg. An effective dose can be determined, for example, by measuring the 
immune response after administration of an expression vector. For example, the 

30 production of antibodies specific for the MHC class 11 epitopes or MHC class I epitopes 
encoded by the expression vector can be measured by methods well known in the art, 
including ELISA or other immunological assays. In addition, the activation of T helper 
cells or a CTL response can be measured by methods well known in the art including, for 
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example, the uptake of ^H-thymidine to measure T cell activation and the release of ^'Cr 
to measure CTL activity (see Examples II and HI below). 

The pharmaceutical compositions comprising an expression vector of the 
invention can be administered to mammals, particularly humans, for prophylactic or 
therapeutic purposes. Examples of diseases that can be treated or prevented using the 
expression vectors of the invention include infection with HBV, HCV, HIV and CMV as 
well as prostate cancer, renal carcinoma, cervical carcinoma, lymphoma, condyloma 
acuminatmn and acquired inmixmodeficiency syndrome (AIDS). 

In therapeutic appUcations, the expression vectors of the invention are 
administered to an individual already suffering from cancer, autoimmune disease or 
infected with a vims. Those in the incubation phase or acute phase of the disease can be 
treated with expression vectors of the invention, including those expressing all universal 
MHC class 11 epitopes, separately or in conjunction with other treatments, as appropriate. 

In therapeutic and prophylactic applications, pharmaceutical compositions 
comprising expression vectors of the invention are administered to a patient in an amount 
sufficient to elicit an effective immune response to an antigen and to ameliorate the signs 
or symptoms of a disease. The amount of expression vector to administer that is 
sufficient to ameliorate the signs or symptoms of a disease is termed a therapeutically 
effective dose. The amount of expression vector sufficient to achieve a therapeutically 
effective dose will depend on the pharmaceutical composition comprising an expression • 
vector of the invention, the manner of administration, the state and severity of the disease 
being treated, the weight and general state of health of the patient and the judgment of the 
prescribing physician. 



25 All publications and patent applications cited in this specification are 

herein incorporated by reference as if each individual publication or patent application 
were specifically and individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by 
way of illustration and example for purposes of clarity of understanding, it will be readily 

30 apparent to one of ordinary skill in the art in light of the teachings of this invention that 
certain changes and modifications may be made thereto without departing from the spirit 
or scope of the appended claims. 
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EXAMPLES 

The following example is provided by way of illustration only and not by 
way of limitation. Those of skill in the art will readily recognize a variety of noncritical 
parameters that could be changed or modified to yield essentially similar results. 

5 

EXAMPLE I: Construction of Expression Vectors Containing MHC Class 11 Epitopes 
This example shows construction of expression vectors containing MHC 

class II epitopes that can be used to target antigens to MHC class II molecules. 

Expression vectors comprising DNA constructs were prepared using 
1 0 overlapping oligonucleotides, poljraerase chain reaction (PGR) and standard molecular 

biology techniques (Dieffenbach & Dveksler, PCR Primer: A Laboratory Manual (1995); 

Sambrook et ah. Molecular Cloning: A Laboratory Manual (2nd ed., 1989), each of 

which is incorporated herein by reference). 

To generate full length wild type li, the full length invariant chain was 
15 amplified, cloned, and sequenced and used in the construction of the three invariant chain 

constructs. Except where noted, the source of cDNA for all the constructs listed below 

was Mouse Spleen Marathon-Ready cDNAmade from Balb/c males (Clontech; Palo Alto 

CA). The primer pairs were the ohgonucleotide 

GCTAGCGCCGCCACCATGGATGACCAACGCGACCTC (SEQ ID NO:40), which is 
20 designated murli-F and contains an Nhel site followed by the consensus Kozak sequence 
and the 5 ' end of the li cDNA; and the oligonucleotide 

GGTACCTCACAGGGTGACTTGACCCAG (SEQ ID N0:41), which is designated 
murli-R and contains a Kpnl site and the 3' end of the li coding sequence. 

For the PCR reaction, 5 \i\ of spleen cDNA and 250 nM of each primer 

25 were combined in a 100 jxl reaction with 0.25 mM each dNTP and 2.5 units of Pfu 

polymerase in Pfu polymerase buffer containing 10 mM KCl, 1 0 mM (NH4)2S04, 20 mM 
Tris-chloride, pH 8.75, 2 mMMgS04, 0.1% TRITON X-100 and 100 ^g/ml bovine serum 
albumin (BS A). A Perkin/Ehner 9600 PCR machine (Perkin Elmer; Foster City CA) was 
used and the cycling conditions were: 1 cycle of 95°C for 5 minutes, followed by 30 

30 cycles of 95'*C for 15 seconds, 52''C for 30 seconds, and 72°C for 1 minute. The PCR 
reaction was run on a 1% agarose gel, and the 670 base pair product was cut out, purified 
by spinning through a Millipore Ultrafree-MC filter (Millipore; Bedford MA) and cloned 
into pCR-Blunt from Invitrogen (San Diego, CA). Individual clones were screened by 
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sequencing, and a correct clone (named bli#3) was used as a template for the helper 
constructs. 

DNA constructs containing pan DR epitope sequences and MHC II 
targeting sequences derived from the li protein were prepared. The li murine protein has 
5 been previously described (Zhu & Jones, Nucleic Acids Res. 17:447-448 (1989)), which is 
incorporated herein by reference. Briefly, the liPADRE construct contains the full length 
li sequence with PADRE precisely replacing the CLIP region. The DNA construct 
encodes amino acids 1 through 87 of invariant chain, followed with the 13 amino acid 
PADRE sequence (SEQ ID NO:38) and the rest of the invariant chain DNA sequence 
10 (amino acids 101-215). The construct was amplified in 2 overlapping halves that were 
joined to produce the final construct. The two primers used to amplify the 5* half were 
murli-F and the oligonucleotide 

CAGGGTCCAGGCAGCCACGAACTTGGCCACAGGTTTGGCAGA (SEQ ID 
NO:42), which is designated liPADRE-R. The liPADRE-R primer includes nucleotides 

1 5 3 03-262 of liPADRE. The 3 ' half was amplified with the primer 

GGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAAC (SEQ ID 
NO:43), which is designated liPADRE-F and includes nucleotides 288-330 of liPADRE; 
and murli-R. The PGR conditions were the same as described above, and the two halves 
were isolated by agarose gel electrophoresis as described above. 

20 Ten microliters of each PGR product was combined in a 1 GO \il PGR 

reaction with an anneaUng temperature of 50*'C for five cycles to generate a full length 
template. Primers murli-F and raurli-R were added and 25 more cycles carried out. The 
full length liPADRE product was isolated, cloned, and sequenced as described above. 
This construct contains the murine li gene with a pan DR epitope sequence substituted for 

25 the CLIP sequence of li (Figure 1 ). 

A DNA construct, designated I80T, containing the cytoplasmic domain, the 
transmembrane domain and part of the luminal domain of li fused to a string of multiple 
MHC class II epitopes was constructed (Figure 2). Briefly, the string of multiple MHC 
class II epitopes was constructed with three overlapping oligonucleotides (oligos). Each 

30 oligo overlapped its neighbor by 1 5 nucleotides and the fmal MHC class 11 epitope string 
was assembled by extending the overlapping oligonucleotides in three sets of reactions 
using PCR. The three oligonucleotides were; oligo 1, nucleotides 241-310, 
CTTCGCATGAAGCTTATCAGCCAGGCTGTGCACGCCGCTCACGCCGAAATCAA 
CGAAGCTGGAAGAACCC (SEQ ED NO:44); 
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oligo 2, nucleotides 364-295, 
TTCTGGTCAGCAGAAAGAACAGGATAGGAGCGTTTGGAGGGCGATAAGCTGG 
AGGGGTTCTTCCAGCTTC (SEQ ID NO:45); and 

oligo 3, nucleotides 350-42, 
5 TTCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTG 
GCTGCCTGGACCCTGAAG (SEQ ID NO:46). 

For the first PGR reaction, 5 jag of oligos 1 and 2 were combined in a 100 
|j,l reaction containing P/m poljmierase. A Perkin/Elmer 9600 PGR machine was used and 
the annealing temperature used was 45° C. The PGR product was gel-purified, and a 
10 second reaction containing the PGR product of oligos 1 and 2 with ohgo 3 was annealed 
and extended for 10 cycles before gel purification of the full length product to be used as 
a "mega-primer." 

The 18 OT construct was made by amplifying bli#3 with murli-F and the 
mega-primer. The cycling conditions were: 1 cycle of 95°C for 5 minutes, followed by 5 
15 cycles of 95°C for 15 seconds, 37°C for 30 seconds, and 72''C for 1 minute. Primer Help- 
epR was added and an additional 25 cycles were carried out with the annealing 
temperature raised to 47°C. The Help-epR primer 

GGTACCTCAAGCGGCAGCCTTCAGGGTCCAGGCA(SEQ ID NO:47) corresponds 
to nucleotides 438-405. The full length I80T product was isolated, cloned, and sequenced 
20 as above. 

The I80T construct (Figure 2) encodes amino acid residues 1 through 80 of 
li, containing the cytoplasmic domain, the transmembrane domain and part of the luminal 
domain, fiased to a string of multiple MHC class 11 epitopes corresponding to: amino acid 
residues 323-339 of ovalbumin 
25 (IleSerGhiAlaValHisAlaAlaHisAlaGluIleAsnGluAIaGlyArg; SEQ ID NO:48); amino 
acid residues 128 to 141 of HBV core antigen (amino acids 

ThrProProAlaTyrArgProProAsnAlaProIIeLeu; SEQ ID NO:49); amino acid residues 182 
to 196 of HBV env (amnio acids PhePheLeuLeuThrArglleLeuThrlleProGlnSerLeuAsp; 
SEQ ID NO:50); and the pan DR sequence designated SEQ ID NO:38> 
30 A DNA constmct containing the cytoplasmic domain, transmembrane 

domain and a portion of the luminal domain of li fused to the MHC class II epitope string 
shown in Figure 2 and amino acid residues 101 to 215 of li encoding the trimerization 
region of li was generated (Figure 3). This construct, designated liThfuU, encodes the 
first 80 amino acids of invariant chain followed by the MHC class II epitope string 
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(replacing CLIP) and the rest of the invariant chain (amino acids 101-215). Briefly, the 
construct was generated as two overlapping halves that were annealed and extended by 
PGR to yield the final product. 

The 5' end of liThfull was made by amplifying I80T with murli-F (SEQ 
5 ID NO:40) and Th-Pad-R. The Th-Pad-R primer AGCGGCAGCCTTCAGGGTC (SEQ 
ID N0:51) corresponds to nucleotides 429-411. The 3' half was made by ampUfying 
bli#3 with liPADRE-F and murli-R (SEQ ID N0:41). The liPADRE-F primer 
GGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAAC (SEQ ID NO:52) 
corresponds to nucleotides 402-444. Each PGR product was gel purified and mixed, then 

10 denatured, annealed, and extended by five cycles of PGR. Primers murli-F (SEQ ID 

NO:40) and murli-R (SEQ ID N0:41) were added and another 25 cycles performed. The 
full length product was gel purified, cloned, and sequenced. 

All of the remaining constructs described below were made essentially 
according to the scheme shown in Figure 18. Briefly, primer pairs IF plus IR, designated 

15 below for each specinc construct, were used to amplify the specific signal sequence and 
contained an overlapping 15 base pair tail identical to the 5' end of the MHC class II 
epitope string. Primer pair Th-ova-F, ATCAGCCAGGCTGTGCACGC (SEQ ID NO:53), 
plus Th-Pad-R (SEQ ID N0:51) were used to amplify the MHC class II epitope string. A 
15 base pair overlap and the specific transmembrane and cytoplasmic tail containing the 

20 targeting signals were amplified with primer pairs 2F plus 2R. 

All three pieces of each cDNA were amplified using the following 
conditions: 1 cycle of 95°C for 5 minutes, followed by 30 cycles of 95°C for 15 seconds, 
52°C for 30 seconds, and 72°C for 1 minute. Each of the three fragments was agrose-gel 
purified, and the signal sequence and MHC class II string firagments were combined and 

25 joined by five cycles in a second PGR. After five cycles, primers IF and Th-Pad-R were 
added for 25 additional cycles and the PGR product was gel piuified. This signal 
sequence plus MHC class II epitope string fi-agment was combined with the 
transmembrane plus cytoplasmic tail firagment for the final PGR. After five cycles, 
primers IF plus 2R were added for 25 additional cycles and the product was gel purified, 

30 cloned and sequenced. 

ADKA construct containing the murine immnnoglobulin kappa signal 
sequence fiised to the T helper epitope string shown in Figure 2 and the transmembrane 
and cytoplasmic domains of LAMP-1 was generated (Figure 4) (Granger et al, J. Biol. 
Chem. 265:12036-12043 (1990)), which is incorporated by reference (mouse LAMP-1 
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GenBank accession No. M32015). This construct, designated kappaLAMP-Th, contains 
the consensus mouse immunoglobulin kappa signal sequence and was amplified from a 
plasmid containing flill length immxmoglobulin kappa as depicted in Figure 18. The 
primer IF used was the oligonucleotide designated KappaSig-F, 
5 GCTAGCGCCGCCACCATGGGAATGCAG (SEQ ID NO:54). 

The primer IR used was the oligonucleotide designated Kappa-Th-R, 
CACAGCCTGGCTGATTCCTCTGGACCC (SEQ ED NO:55). 

The primer 2F used was the oligonucleotide designated PAD/LAMP-F, 
CTGAAGGCTGCCGCTAACAACATGTTGATCCCC (SEQ ID NO:56). The primer 2R 

10 used was the oligonucleotide designated LAMP-CYTOR, 
GGTACCCTAGATGGTCTGATAGCC (SEQ ID NO:57). 

ADNA construct containing the signal sequence of H2-M fused to the 
MHC class II epitope string shown in Figure 2 and the transmembrane and cytoplasmic 
domains of H2-M was generated (Figure 5). The mouse H2-M gene has been described 

15 previously, Peleraux et al, Immunogenetics 43:204-214 (1996)), which is incorporated 
herein by reference. This construct was designated H2M-Th and was constructed as 
depicted in Figure 18. The primer IF used was the oHgonucleotide designated H2-Mb- 
IF, GCC GCT AGC GCC GCC ACC ATG GCT GCA CTC TOG (SEQ ID NO:58). The 
primer IRused was the oligonucleotide designated H2-Mb-1R, CAC AGC CTG GCT 

20 GATCCCCATACAGTGCAG(SEQIDNO:59). The primer 2F used was the 

oligonucleotide designated H2-Mb-2F, CTG AAG GCT GCC GCT AAG GTC TCT GTG 
TCT (SEQ ID NO:60). The primer 2R used was the oligonucleotide designated H2-Mb- 
2R, GCG GGT ACC CTA ATG CCG TCC TTC (SEQ ID N0:61). 

ADNA construct containing the signal sequence of H2-DO fused to the 

25 MHC class II epitope string shown in Figure 2 and the transmembrane and cytoplasmic 
domains of H2-D0 was generated (Figure 6). The mouse H2-D0 gene has been 
described previously (Larhammar et al. J. Biol. Chem. 260:14111-14119 (1985)), which 
is incorporated herein by reference (GenBank accession No. M19423). This construct, 
designated H20-Th, was constructed as depicted in Figure 18. The primer IF used was 

30 the oHgonucleotide designated H2-0b-lF, GCG GCT AGC GCC GCC ACC ATG GGC 
GCT GGG AGG (SEQ ID NO:62). The primer IR used was the oligonucleotide 
designated H2-Ob-lR, TGC AC A GCC TGG CTG ATG GAATCC AGC CTC (SEQ ID 
NO:63). The primer 2F used was the oligonucleotide designated H2-Ob-2F, CTG AAG 
GCT GCC GCT ATA CTG AGT GGA GCT (SEQ ID NO:64). The primer 2R used was 
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the oligonucleotide designated H2-Ob-2R, GCC GGT ACC TCA TOT GAG ATG TCC 
CG (SEQ ID NO:65). 

A DNA construct containing a pan DR epitope sequence {SEQ ID NO:38) 
fiised to the amino-terminus of influenza matrix protein is generated (Figure 7). This 
5 construct, designated PADRE-Influenza matrix, contains the universal MHC class II 
epitope PADRE attached to the amino terminus of the influenza matrix coding sequence. 
The construct is made using a long primer on the 5' end primer. The 5' primer is the 
oligonucleotide 

GCTAGCGCCGCCACCATGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
10 CGCTATGAGTCTTCTAACCGAGGTCGA (SEQ ID NO:66). The 3' primer is the 
oligonucleotide TCACTTGAATCGCTGCATCTGCACCCCCAT (SEQ ID NO:67). 
Influenza virus from the America Type Tissue Collection (ATCC) is used as a source for 
the matrix coding region (Perdue et al Science 279:393-396 (1998)), which is 
incorporated herein by reference (GenBank accession No. AF036358). 
1 5 A DNA construct containing a pan DR epitope sequence (SEQ ID NO:38) 

fused to the amino-terminus of HBV-S antigen was generated (Figure 8). This construct 
is designated PADRE -HBV-s and was generated by annealing two overlapping 
oligonucleotides to add PADRE onto the amino terminus of hepatitis B surface antigen 
(Michel et al, Proc. Natl. Acad. Sci. USA 81:7708-7712 (1984); Michel et al. Pmc. Natl. 
20 Acad, Sci. USA 92:5307-5311 (1995)), each of which is incorporated herein by reference. 
One oligonucleotide was 

GCTAGCGCCGCCACCATGGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGC 
CGCTC (SEQ ID NO:68). The second oligonucleotide was 

CTCGAGAGCGGCAGCCTTCAGGGTCCAGGCAGCCACGAACTTGGCCATGGTG 
25 GCGGCG (SEQ ID NO:69). When annealed, the oligos have Nhel and Xhol cohesive 
ends. The oligos were heated to 100°C and slowly cooled to room temperature to mtieal. 
A three part ligation joined PADRE with an Xhol-Kpnl fragment containing HBV-s 
antigen into the Nhel plus Kpnl sites of the expression vector. 

A DNA construct containing the signal sequence of Ig-a fused to the MHC 
30 class II epitope string shown in Figme 2 and the transmembrane and cytoplasmic domains 
of Ig-a was generated (Figure 9). The mouse Ig-a gene has been described previously 
(Kashiwamura et al., J. Immunol. 145:337-343 (1990)), which is incorporated herein by 
reference (GenBank accession No. M31773). This construct, designated Ig-alphaTh, was 
constructed as depicted in Figure 18. The primer IF used was the oligonucleotide 
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designated Ig alpha-lF, GCG GCT AGC GCC GCC ACC ATG CCA GGG GGT CTA 
(SEQ ID NO:70). The primer IR used was the oligonucleotide designated Igalpha-IR, 
GCA CAG CCT GGC TGATGG CCT GGC ATC CGG (SEQ ID NO:71). The primer 2F 
used was the oligonucleotide designated Igalpha-2F, CTG AAG GCT GCC GCT GGG 
5 ATC ATC TTG CTG (SEQ ID NO:72). The primer 2R used was the oligonucleotide 
designated Igalpha-2R, GCG GGT ACC TCATGG CTT TTC CAG CTG (SEQ ID 
NO:73). 

ADNA construct containing the signal sequence of Ig-p fused to the MHC 
class II string shown in Figure 2 and the transmembrane and cytoplasmic domains of IgP 

10 was generated (Figure 10). The Ig-p sequence is the B29 gene of mouse and has been 
described previously (Hermanson et al., Proc. Natl. Acad. Sci. USA 85:6890-6894 
(1988)), which is incorporated herein by reference (GenBank accession No. J03857). 
This construct, designated Ig-betaTh, was constructed as depicted in Figure 18. The 
primer IF used was the oligonucleotide designated B29-1F (33mer) GCG GCT AGC 

15 GCC GCC ACC ATG GCC ACA CTG GTG (SEQ ID NO:74). The primer IR used was 
the oligonucleotide designated B29-1R (30mer) CAC AGC CTG GCT GAT CGG CTC 
ACC TGA GAA (SEQ ID NO:75). The primer 2F used was the oligonucleotide 
designated B292F (30mer) CTG AAG GCT GCC GCT ATT ATC TTG ATC CAG (SEQ 
ID NO: 76). The primer 2R used was the oUgonucleotide designated B29-2R (27mer), 

20 GCC GGT ACC TCA TTC CTG GCC TGG ATG (SEQ ID NO:77). 

ADNA construct containing the signal sequence of the kappa 
immunoglobulin signal sequence fused to the MHC class II epitope string shown in 
Figure 2 was constructed (Figure 11). This construct is designated SigTh and was 
generated by using the kappaLAMP-Th construct (shown in Figure 4) and amplifying 

25 with the primer pair KappaSig-F (SEQ ED NO:54) plus Help-epR (SEQ ID NO:47) to 
create SigTh. SigTh contains the kappa immunoglobulin signal sequence fused to the T 
helper epitope string and terminated with a translational stop codon. 

Constructs encoding human sequences corresponding to the above 
described constructs having mouse sequences are prepared by substituting human 

30 sequences for the mouse sequences. Briefly, for the liPADRE construct, corresponding to 
Figure 1, amino acid residues 1-80 from the human li gene HLA-DR sequence (Figure 
12) (GenBank accession No. X00497 M14765) is substituted for the mouse li sequences, 
which is fused to PADRE, followed by human invariant chain HLA-DR amino acid 
residues 114-223. For the I80T construct, corresponding to Figure 2, amino acid residues 
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1-80 from the human sequence of li is followed by a MHC class II epitope string. For the 
liThfixll construct, corresponding to Figure 3, amino acid residues 1-80 from the human 
sequence of li, which is fused to a MHC class 11 epitope string, is followed by human 
invariant chain amino acid residues 114-223. 
5 For the LAMP-Th construct, similar to Figure 4, the signal sequence 

encoded by amino acid residues 1-19 (nucleotides 11-67) of human LAMP-1 (Figure 13) 
(GenBank accession No. J04182), which is fiised to the MHC class II epitope string, is 
followed by the transmembrane (nucleotides 1163-1213) and cytoplasmic tail 
(nucleotides 1214-1258) region encoded by amino acid residues 380-416 of human 
10 LAMP-1. 

For the HLA-DM-Th construct, corresponding to Figure 5, the signal 
sequence encoded by amino acid residues 1-17 (nucleotides 1-51) of human HLA-DMB 
(Figure 14) (GenBank accession No. U15085), which is frised to the MHC class II epitope 
string, is followed by the transmembrane (nucleotides 646-720) and cytoplasmic tail 
15 (nucleotides 721-792) region encoded by amino acid residues 216-263 of human HLA- 
DMB. 

For the HLA-DO-Th construct, corresponding to Figure 6, the signal 
sequence encoded by amino acid residues 1-21 (nucleotides 1-63) of human HLA-DO 
(Figure 15) (GenBank accession No. L29472 J02736 N00052), which is fused to the 

20 MHC class II epitope string, is followed by the transmembrane (nucleotides 685-735) and 
cytoplasmic tail (nucleotides 736-819) region encoded by amino acid residues 223-273 of 
human HLA-DO. 

For the Ig-alphaTh constmct, corresponding to Figure 9, the signal 
sequence encoded by amino acid residues 1-29 (nucleotides 1-87) of human Ig-a MB-1 

25 (Figure 16) (GenBank accession No. U05259), which is fiised to the MHC class II epitope 
string, is followed by the transmembrane (nucleotides 424-498) and cytoplasmic tail 
(nucleotides 499-678) region encoded by amino acid residues 142-226 of human Ig-a 
MB-i. 

For the Ig-betaTh construct, corresponding to Figure 10, the signal 
30 sequence encoded by amino acid residues 1-28 (nucleotides 17-100) of human Ig-p B29 
(Figure 17) (GenBank accession No. M80461), which is fiised to the MHC class II 
epitope string, is followed by the transmembrane (nucleotides 500-547) and cytoplasmic 
tail (nucleotides 548-703) region encoded by amino acid residues 156-229 of human Ig-p. 
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The SigTh construct shown in Figure 11 can be used in mouse and human. 
Alternatively, a signal sequence derived from an appropriate human gene containing a 
signal sequence can be substituted for the mouse kappa immunoglobulin sequence in the 
'Sig Th construct. 

5 The PADRE-Influenza matrix constmct shown in Figure 7 and the 

PADRE-HBVs construct shown in Figure 8 can be used in mouse and human. 

Some of the DNA constructs described above were cloned into the vector 
pEP2 (Figure 19; SEQ ID NO:35). The pEP2 vector was constructed to contain dual 
CMV promoters. The pEP2 vector used the backbone of pcDNA3. l(-)Myc-His A from 

10 Invitrogen and pIRESlhyg from Clontech. Changes were made to both vectors before the 
CMV transcription unit from pIRESlhyg was moved into the modified pcDNA vector. 

The pcDNA3.1(-)Myc-His A vector (http://www.invitrogen.com) was 
modified. Briefly, the PvuII fragment (nucleotides 1342-3508) was deleted. ABspHI 
fragment that contains the AmpicilHn resistance gene (nucleotides 4404-5412) was cut 

15 out. The Ampicillin resistance gene was replaced with the kanamycin resistance gene 

from pUC4K (GenBank Accession #X06404). pUC4K was ampUfied with the primer set: 
TCTGATGTTACATTGCACAAG (SEQ ID NO:78) (nucleotides 1621-1601) and 
GCGCACTCATGATGCTCTGCCAGTGTTACAACC (SEQ ID NO:79) (nucleotides 
682-702 plus the addition of a BspHI restriction site on the 5 ' end). The PGR product 

20 was digested with BspHI and hgated into the vector digested with BspHI. The region 
between the Pmel site at nucleotide 905 and the EcoRV site at nucleotide 947 was 
deleted. The vector was then digested with Pmel (cuts at nucleotide 1 076) and Apal (cuts 
at nucleotide 1004), Klenow filled in at the cohesive ends and ligated. The Kpnl site at 
nucleotide 994 was deleted by digesting with Kpnl and filling in the ends with Klenow 

25 DNA polymerase, and ligating. The intron A sequence from CMV (GenBank accession 
M21295, nucleotides 635-1461) was added by amplifying CMV DNA with the primer set: 
GCGTCTAGAGTAAGTACCGCCTATAGACTC (SEQ ID NO:80) (nucleotides 635-655 
plus an Xbal site on the 5' end) and CCGGCTAGCCTGCAGAAAAGACCCATGGAA 
(SEQ ID N0;81) (nucleotides 1461-1441 plus an Nhel site on the 3' end). The PGR 

30 product was digested with Xbal and Nhel and ligated into the Miel site of the vector 

(nucleotide 895 of the original pcDNA vector) so that the Nhel site was on the 3' end of 
the intron. 

To modify the pIRESlhyg vector (GenBank Accession U89672, 
Clontech), the Kpnl site (nucleotide 911) was deleted by cutting and filling in with 
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Klenow. The plasmid was cut with NotI (nucleotide 1254) and Xbal (nucleotide 3 196) 
and a polylinker oligo was inserted into the site. The polylinker was formed by annealing 
the following two oligos: 

GGCCGCAAGGAAAAAATCTAGAGTCGGCCATAGACTAATGCCGGTACCG (SEQ 
5 IDNO:82}and 

CTAGCGGTACCGGCATTAGTCTATGGCCCGACTCTAGATTTTTTCCTTGC (SEQ 
ID NO:83). The resulting plasmid was cut with Hindi and the fragment between Hindi 
sites 234 and 3538 was isolated and Hgated into the modified pcDNA vector. This 
fragment contains a CMV promoter, intron, polylinker, and polyadenylation signal. 

1 0 The pIREShyg piece and the pcDNA piece were combined to form pEP2. 

The modified pcDNA3.1(-)Myc-His A vector was partially digested with PvuII to isolate 
a linear fragment with the cut downstream of the pcDN A polyadenylation signal (the 
other PvuII site is the CMV intron). The Hindi fragment from the modified pIRESlhyg 
vector was ligated into the PvuII cut vector. The polyadenylation signal from the pcDNA 

1 5 derived transcription unit was deleted by digesting with EcoRI (pcDNA nucleotide 955) 
and Xhol (pIRESlhyg nucleotide 3472) and replaced with a synthetic polyadenylation 
sequence. The synthetic polyadenylation signal was described in Levitt et al. Genes and 
Development 3:1019-1025 (1989)). 

Two oligos were annealed to produce a fragment that contained a 

20 polylinker and polyadenylation signal with EcoRI and Xhol cohesive ends. The oligos 
were: 

AATTCGGATATCC..i^GCTTGATGAATAAAAGATCAGAGCTCTAGTGATCTGTGT 
GTTGGTTTTTTTGTGTGC (SEQ ID NO:84) and 

TCGAGCACACAAAAAACCAACACACAGATCACTAGAGCTCTGATCTTTTTATT 

25 CATCAAGCTTGGATATCCG (SEQ ID NO:85). 

The resulting vector is named pEP2 and contains two separate 
franscription units. Both transcription units use the same CMV promoter but each 
contains different intron, polylinker, and polyadenylation sequences. 

The pEP2 vector contains two transcription imits. The first franscription 

30 unit contains the CMV promoter initially from pcDNA (nucleotides 2 1 0-862 in Figure 
19), CMV intron A sequence (nucleotides 900-1728 in Figure 19), polylinker cloning site 
(nucleotides 1740-1760 in Figure 19) and synthetic polyadenylation signal (nucleotides 
1764-1769 in Figure 19). The second franscription unit, which was initially derived from 
pIRESlhyg, contains the CMV promoter (nucleotides 3 165-2493 in Figure 19), intron 
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sequence (nucleotides 2464-2173 in Figure 19), polylinker clone site (nucleotides 2126- 
2095 in Figure 19) and bovine growth hormone polyadenylation signal (nucleotides 1979- 
1974 in Figure 19). The kanamycin resistance gene is encoded in nucleotides 4965-4061 
(Figure 19). 

5 The DNA constructs described above were digested with Nhel and Kpnl 

and cloned into the Xbal and Kpnl sites of pEP2 (the second transcription unit). 

Additional vectors were also constructed. To test for the effect of co- 
expression of MHC class I epitopes with MHC class 11 epitopes, an insert was generated, 
designated AOS, that contains nine MHC class I epitopes. The AOS insert was initially 

1 0 constructed in the vector pMIN.O (Figure 20; SEQ ID N0:3 6). Briefly, the AOS insert 
contains nine MHC class I epitopes, six restricted by HLA-A2 and three restricted by 
HLA-Al 1, and the universal MHC class II epitope PADRE. The vector pMIN.O contains 
epitopes from HBV, HIV and a mouse ovalbumin epitope. The MHC class I epitopes 
appear in pMIN.O in the following order: 

1 5 consensus mouse Ig Kappa signal sequence (pMIN.O amino acid residues 

1-20, nucleotides 16-81) MQVQIQSLFLLLLWVPGSRG (SEQ ID NO:86) encoded by 
nucleotides ATG CAG GTG CAG ATC CAG AGC CTG TTT CTG CTC CTC CTG TGG 
GTG CCC GGG TCC AGA GGA (SEQ ID NO:87); 

HBV pol 149-159 (All restricted) 

20 (pMIK.O amino acid residues 21-31, nucleotides 82-1 14) 

HTLWKAGILYK (SEQ ID NO:88) encoded by nucleotides CAC ACC CTGTGG AAG 
GCC GGA ATC CTG TAT AAG (SEQ ID NO:89); 

PADRE-universal MHC class II epitope (pMIN.O amino acid residues 32- 
45, nucleotides 115-153) AKFVAAWTLKAAA (SEQ ID NO:38) encoded by nucleotides 

25 GCC AAG TTC GTG GCT GCC TGG ACC CTG AAG GCT GCC GCT (SEQ ID 
NO:90); 

HBV core 18-27 (A2 restricted) (pMIN.O amino acid residues 46-55, 
nucleotides 154-183) FLPSDFFPSV (SEQ ID N0:91) encoded by nucleotides TTC CTG 
CCT AGC GAT TTC TTT CCT AGC GTG (SEQ ID NO:92); 
30 HIV env 120-128 (A2 restricted) (pMIN.O amino acid residues 56-64, 

nucleotides 184-210) KLTPLCVTL (SEQ ID NO:93) encoded by nucleotides AAG CTG 
ACC CCA CTG TGC GTG ACC CTG (SEQ ID NO:94); 
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HBV pol 551-559 {A2 restricted) (pMIN.O amino acid residues 65-73, 
nucleotides 211-237) YMDDVVLGA (SEQ ID NO:95) encoded by nucleotides TAT ATG 
GAT GAC GTG GTG CTG GGAGCC (SEQ ID NO:96); 

mouse ovalbumin 257-264 (K'' restricted) (pMIN.O amino acid residues 
5 74-8 1, nucleotides 238-26 1) SIINFEKL (SEQ ID NO;97) encoded by nucleotides AGC 
ATC ATC AAC TTC GAG AAG CTG (SEQ ID NO:98); 

HBV pol 455-463 (A2 restricted) (pMIN.O amino acid residues 82-90, 
nucleotides 262-288) GLSRYVARL (SEQ ID NO:99) encoded by nucleotides GGA CTG 
TCC AGA TAG GTG GCT AGG CTG (SEQ ID NO: 1 00); 
1 0 HIV pol 476-84 (A2 restricted) (pMIN.O amino acid residues 9 1 -99, 

nucleotides 289-3 15) ILKEPVHGV (SEQ ID NO: 101) encoded by nucleotides ATC CTG 
AAG GAG CCT GTG CAC GGC GTG (SEQ ID NO: 102); 

HBV core 141-151 (All restricted) 

(pMIN.O amino acid residues 100-110, nucleotides 316-348) 
15 STLPETTVVRR (SEQ ID NO:103) encoded by nucleotides TCC ACC CTG CCA GAG 
ACC ACC GTG GTG AGG AGA (SEQ ID NO: 1 04); 

HIV env 49-58 (All restricted) (pMIN.O amino acid residues 111-120, 
nucleotides 349-378) TVYYGVPVWK (SEQ ID NO: 105) encoded by nucleotides ACC 
GTG TAG TAT GGA GTG CCT GTG TGG AAG (SEQ ID NO: 106); and 
20 HBV env 335-343 (A2 restricted) (pMIN.O amino acid residues 121-129, 

nucleotides 378-405) WLSLLVPFV (SEQ ID NO: 107) encoded by nucleotides TGG 
CTG AGC CTG CTG GTG CCC TTT GTG (SEQ ID NO:108). 

The pMIN.O vector contains a Kpnl restriction site (pMIN.O nucleotides 
406-41 1) and a Nhel restriction site (pMIN.O nucleotides 1-6). The pMIN.O vector 
25 contains a consensus Kozak sequence (nucleotides 7-1 8) (GCCGCCACCATG; SEQ ID 
NO:109) and murine Kappa Ig-light chain signal sequence followed by a string of 10 
MHC class I epitopes and one universal MHC class 11 epitope. The pMIN.O sequence 
encodes an open reading frame fused to the Myc and His antibody epitope tag coded for 
by the pcDNA 3.1 Myc-His vector. The pMIN.O vector was constructed with eight 
30 oligonucleotides: 

Mini oligo 

GAGGAGCAGAAACAGGCTCTGGATCTGCACCTGCATTCCCATGGTGGCGGCGC 
TAGCAAGCTTCTTGCGC (SEQ ID NO:110); 
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Min2 oligo 

CCTGTTTCTGCTCCTCCTGTGGGTGCCCGGGTCCAGAGGACACACCCTGTGGA 
AGGCCGGAATCCTGTATA (SEQ ID NO: 1 1 1 ); 
Min3 oligo 

5 TCGCTAGGCAGGA.\AGCGGCAGCCTTCAGGGTCCAGGCAGCCACGAACTTGG 
CCTTATACAGGATTCCGG (SEQ ID N0:112); 
Min4 oligo 

CTTTCCTGCCTAGCGATTTCTTTCCTAGCGTGAAGCTGACCCCACTGTGCGTGA 
CCCTGTATATGGATGAC (SEQ ID NO: 11 3); 
10 Min5 oligo 

CGTACCTGGACAGTCCCAGCTTCTCGAAGTTGATGATGCTGGCT 
CCCAGCACCACGTCATCCATATACAG (SEQ ID N0:1 14); 

Min6 oligo 

GGACTGTCCAGATACGTGGCTAGGCTGATCCTGAAGGAGCCTGTGCACGGCGT 
15 GTCCACCCTGCCAGAGAC (SEQ ID NO:l 15); 
Min7 oligo 

GCTCAGCCACTTCCACACAGGCACTCCATAGTACACGGTCCTCCTCACCACGG 
TGGTCTCTGGCAGGGTG (SEQ ID NO: 11 6); 
MinS oligo 

20 GTGGAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGGGTACCTGATCTAGAGC 
(SEQ ID NO: 11 7). 

Additional primers were flanking primer 5', GCG CAA GAA GCT TGC 
TAG CG (SEQ ID NO: 1 1 8) and flanking primer 3 GCT CTA GAT CAG GTA CCC 
CAC (SEQ ID NO: 11 9). 

25 The original pMIN.O minigene construction was carried out using eight 

overlapping oligos averaging approximately 70 nucleotides in length, which were 
synthesized and HPLC purified by Operon Technologies Inc. Each oligo overlapped its 
neighbor by 15 nucleotides, and the final multi-epitope minigene was assembled by 
extending the overlapping oligos in three sets of reactions using PGR (Ho et al, Gene 

30 77:51-59(1989). 

For the first PGR reaction, 5 p,g of each of two oligos were annealed and 
extended: 1+2, 3+4, 5+6, and 7+8 were combined in 100 |xl reactions containing 0.25 mM 
each dNTP and 2.5 units of Pfu polymerase in Pfu polymerase buffer containing 10 mM 
KCl, 10 mM (NH4)2S04, 20 mM Tris-chloride, pH 8.75, 2 mM MgS04, 0.1% TRITON 
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X-100 and 100 mg/ml BSA. APerkin/Elmer 9600 PGR machine was used and the 
annealing temperature used was 5°C below the lowest calculated Tm of each primer pair. 
The fiili length dimer products were gel-purified, and two reactions containing the 
product of 1-2 and 3-4, and the product of 5-6 and 7-8 were mixed, annealed and 
5 extended for 10 cycles. Half of the two reactions were then mixed, and 5 cycles of 
annealing and extension carried out before flanking primers were added to amplify the 
full length product for 25 additional cycles. The full length product was gel purified and 
cloned into pCR-blunt (Invitrogen) and individual clones were screened by sequencing. 
The Min insert was isolated as an Nhel-Kpnl fragment and cloned into the same sites of 

10 pcDNA3.1(-)/Myc-His A (Invitrogen) for expression. The Min protein contains the Myc 
and His antibody epitope tags at its carboxyl-terminal end. 

For all the PGR reactions described, a total of 30 cycles were performed 
using Pfii polymerase and the following conditions: 95°C for 15 seconds, annealing 
temperature for 30 seconds, 72°C for one minute. The annealing temperature used was 

15 5°G below the lowest calculated Tm of each primer pair. 

Three changes to pMIN.O were made to produce pMIN. 1 (Figure 2 1 ; SEQ 
ID NO:37, also referred to as pMIN-AOS). The mouse ova epitope was removed, the 
position 9 alanine anchor residue (#547) of HBV pol 551-560 was converted to a valine 
which increased the in vitro binding affinity 40-foId, and a translational stop codon was 

20 introduced at the end of the multi-epitope coding sequence. The changes were made by 
amplifying two overlapping fragments and combining them to yield the full length 
product. 

The first reaction used the 5 ' pcDNA vector primer T7 and the primer Min- 
ovaR (nucleotides 247-2 1 8) TGGACAGTGGCACTGCGAGGACG ACGTCAT (SEQ ID 
25 NO: 120). The 3 ' half was amplified with the primers: Min-ovaF (nucleotides 228-257) 
GCTGGGAGTGGGACTGTCCAGGTACGTGGC (SEQ ID NO:121) and Min-StopR 
(nucleotides 390-361) GGTACCTCACACAAAGGGCACCAGCAGGC (SEQ ID 
NO: 122) 

The two firagments were gel purified, mixed, denatured, annealed, and 
30 filled in with five cycles of PGR. The full length fi-agment was amplified with the 
flanking primers T7 and Min-Stop for 25 more cycles. The product was gel purified, 
digested with Nhel and Kpnl and cloned into pcDNA3. 1 for sequencing iand expression. 
The insert fi-om pMin.l was isolated as an Nhel-Kpnl firagment and cloned into pEP2 to 
make pEP2-AOS. 
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EXAMPLE II: Assav for T Helper Cell Activation 

This example shows methods for assaying T helper cell activity. One 
method for assaying T helper cell activity uses spleen cells of an immunized organism. 
5 Briefly, a spleen cell pellet is suspended with 2-3 ml of red blood cell lysis buffer 
containing 8.3 g/Hter ammonium chloride in 0.001 M Tris-HCl, pH 7.5. The cells are 
incubated in lysis buffer for 3-5 min at room temperature with occasional vortexing. An 
excess volume of 50 ml of RIO medium is added to the cells, and the cells are pelleted. 
The cells are resuspended and pelleted one or two more times in R2 medium or RIO 
10 medium. 

The cell pellet is suspended in RIO medium and counted. If the cell 
suspension is aggregated, the aggregates are removed by filtration or by allowing the 
aggregates to settle by gravity. The cell concentration is brought to 10^/ml, and 100 [i\ of 
spleen cells are added to 96 well flat bottom plates. 
15 Dilutions of the appropriate peptide, such as pan DR epitope (SEQ ID 

NO:145), are prepared in RIO medium at 100, 10, 1, 0.1 and 0.01 ^g/ml, and 100 jal of 
peptide are added to duplicate or triplicate wells of spleen cells. The final peptide 
concentration is 50, 5, 0.5, 0.05 and 0.005 fig/ml. Control wells receive 100 ^tl RIO 
medium. 

20 The plates are incubated for 3 days at 37°C. After 3 days, 20 ^il of 

50 }iCi/ml ^H-thymidine is added per well. Cells are incubated for 1 8-24 hours and then 
harvested onto glass fiber filters. The incorporation of ■^H-thymidine into DNA of 
proliferating cells is measured in a beta counter. 

A second assay for T helper cell activity uses peripheral blood 

25 mononuclear cells (PBMC) that are stimulated in vitro as described in Alexander et al, 
supra and Sette (WO 95/07,707), as adapted fi-om Manca et al. J. Immunol 146:1964- 
1971 (1991), which is incorporated herein by reference. Briefly, PBMC are collected 
from healthy donors and purified over FicoU-Plaque (Pharmacia Biotech; Piscataway, 
NJ). PBMC are plated in a 24 well tissue culture plate at 4 x 10^ cells/ml. Peptides are 

30 added at a final concentration of 10 ng/ml. Cultures are incubated at 37°C in 5% CO2. 

On day 4, recombinant interleukin-2 (IL-2) is added at a final 
concentration of 10 ng/ml. Cultures are fed every 3 days by aspirating 1 ml of medium 
and replacing with firesh medium containing IL-2. Two additional stimulations of the T 
cells with antigen are performed on approximately days 14 and 28. The T cells (3 x 
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10^/well) are stimulated with peptide (10 |J.g/ml) using autologous PBMC cells (2x10^ 
irradiated cells/well) (irradiated with 7500 rads) as antigen-presenting cells in a total of 
three wells of a 24 well tissue culture plate. In addition, on day 14 and 28, T cell 
proliferative responses are determined under the following conditions: 2 x lO'^ T 
5 cells/well; 1 x 10^ irradiated PBMC/well as antigen-presenting cells; peptide 

concentration varying between 0.01 and 10 jig/ml final concentration. The proliferation 
of the T cells is measured 3 days later by the addition of ■'H-thymidine (1 |xCi/well) 1 8 hr 
prior to harvesting the cells. Cells are harvested onto glass filters and ■'H-thymidine 
incorporation is measured in a beta plate counter. These results demonstrate methods for 
10 assaying T helper cell activity by measuring ■'H-thymidine incorporation. 

EXAMPLE III: Assav for Cytotoxic T Lvmphocvte Response 

This example shows a method for assaying cytotoxic T lymphocyte (CTL) 
activity. A CTL response is measured essentially as described previously (Vitiello et ai, 

15 Eur. J. Immunol. 27:671-678 (1997), which is incorporated herein by reference). Briefly, 
after approximately 10-35 days following DNA immunization, splenocytes from an 
animal are isolated and co-cultured at 37°C with syngeneic, irradiated (3000 rad) peptide- 
coated LPS blasts (1 x 10^ to 1.5 x 10^ cells/ml) in 10 ml RIO in T25 flasks. LPS blasts 
are obtained by activating splenocytes (1 x 10^ to 1.5 x 10* cells/ml) with 25 jig/ml 

20 lipopolysaccharides (LPS) (Sigma cat. no. L-2387; St. Louis, MO) and 7 lAg/mi dextran 
sulfate (Pharmacia Biotech) in 30 ml RIO medium inT75 flasks for 3 days at 37°C. The 
lymphoblasts are then resuspended at a concentration of 2,5 x 10^ to 3.0 x lO'/ml, 
irradiated (3000 rad), and coated with the appropriate peptides (lOOjig/ml) for 1 h at 
37''C. Cells are washed once, resuspended in RIO medium at the desired concentration 

25 and added to the responder cell preparation. Cultures are assayed for cytolytic activity on 
day 7 in a ^^Cr-release assay. 

For the ^'Cr-release assay, target cells are labeled for 90 min at 37°C with 
150 \il sodium ^'chromate (^'Cr) (New England Nuclear; Wilmington DE), washed three 
times and resuspended at the appropriate concentration in RIO medium. For the assay, 

30 10'* target cells are incubated in the presence of different concentrations of effector cells 
in a final volume of 200 yd in U-bottom 96 well plates in the presence or absence of 10 
|a.g/ml peptide. Supematants are removed after 6 h at 37°C, and the percent specific lysis 
is determined by the formula: percent specific lysis = 100 x (experimental release - 
spontaneous release), (maximum release - spontaneous release). To facilitate comparison 
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of responses from different experiments, the percent release data is transformed to lytic 
units 30 per 10^ cells (LU30/10^), with 1 LU30 defined as the nmnber of effector cells 
required to induce 30% lysis of 10* target ceils in a 6 h assay. LU values represent the 
LU30/10^ obtained in the presence of peptide minus LU30/10* in the absence of peptide. 
5 These results demonstrate methods for assaying CTL activity by measuring ^'Cr release 
from cells. 

EXAMPLE IV: T Cell Proliferation in Mice Immunized with Expression Vectors 

Encoding MHC Class n Epitopes and MHC Class II Targeting Sequences 
10 This example demonstrates that expression vectors encoding MHC class II 

epitopes and MHC class II targeting sequences are effective at activating T cells. 

The constmcts used in the T cell proliferation assay are described in 

Example I and were cloned into the vector pEP2, a CMV driven expression vector. The 

peptides used for T cell in vitro stimulation are: Ova 323-339, ISQAVHAAHAEINEAGR 
15 (SEQ ID NO: 123); HBVcorel28, TPPAYRPPNAPILF (SEQ ID NO: 124); HBVenvl82, 

FFLLTRILTIPQSLD (SEQ ID NO: 125); and PADRE, AKFVAAWTLKAAA (SEQ ID 

NO:38). 

T cell prohferation was assayed essentially as described in Example 11. 
Briefly, 12 to 16 week old B6D2 Fl mice (2 mice per construct) were injected with 100 

20 jxg of the indicated expression vector (50 jig per leg) in the anterior tibialis muscle. After 
eleven days, spleens were collected from the mice and separated into a single cell 
suspension by Bounce horaogenization. The splenocytes were counted and one million 
splenocytes were plated per well in a 96-well plate. Each sample was done in triplicate. 
Ten pg/ml of the corresponding peptide encoded by the respective expression vectors was 

25 added to each well. One well contained splenocytes without peptide added for a negative 
control. Cells were cultured at 37°C, 5% CO2 for three days. 

After three days, one [iCi of ^H-thymidine was added to each well. After 
18 hours at 37°C, the cells were harvested onto glass filters and '^H incorporation was 
measured on an LKB {3 plate counter. The results of the T cell proliferation assay are 

30 shown in Table 9. Antigenspecific T cell prohferation is presented as the stimulation 
index (SI); this is defined as the ratio of the average •'H-thymidine incorporation in the 
presence of antigen divided by the ^H-thymidine incorporation in the absence of antigen. 

The immunogen "PADRE + IFA" is a positive control where the PADRE 
peptide in incomplete Freund's adjuvant was injected into the mice and compared to the 
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response seen by injecting the MHC class 11 epitope constructs containing a PADRE 
sequence. As shown in Table 9, most of the expression vectors tested were effective at 
activating T cell proliferation in response to the addition of PADRE peptide. The activity 
of several of the expression vectors was comparable to that seen with immunization with 
5 the PADRE peptide in incomplete Freund's adjuvant. The expression vectors containing 
both MHC class I and MHC class II epitopes, pEP2-A0S and pcDNA-AOS, were also 
effective at activating T cell proliferation in response to the addition of PADRE peptide. 

These results show that expression vectors encoding MHC class 11 
epitopes fused to a MHC class II targeting sequence is effective at activating T cell 
10 proliferation and are useful for stimulating an immune response. 

EXAMPLE V: In vixo assay Using Transgenic Mice 
A. Materials and methods 

Peptides were synthesized according to standard F-moc solid phase 
1 5 synthesis methods which have been previously described (Ruppert et al. Cell 74:929 
(1993); Sette et al., Mol. Immunol. 31:813 (1994)). Peptide purity was determined by 
analytical reverse-phase HPLC and purity was routinely >95%. Synthesis and 
purification of the Theradigm-HBV lipopeptide vaccine is described in (Vitiello et al, J. 
Clin. Invest. 95:341 (1995)). 

20 

Mice 

HLA-A2.1 transgenic mice used in this study were the Fl generation 
derived by crossing transgenic mice expressing a chimeric gene consisting of the al, a2 
domains of HLA-A2.1 and a3 domain of H-2K'' with SJL/J mice (Jackson Laboratory, 
25 Bar Harbor, ME). This strain will be referred to hereafter as HLA-A2.1/K''-H-2'"". The 
parental HLA-A2,1/K'' transgenic strain was generated on a C57BL/6 background using 
the transgene andmediods described in (Vitiello etal, J. Exp. Med. 173:1007 (1991)). 
HLA-Al l/K*' transgenic mice used in the current study were identical to those described 
in (Alexander et al, J. Immunol. 159:4753 (1997)). 

30 

Cell lines. MHC purification, and peptide binding assay 
Target cells for peptide-specific cytotoxicity assays were Jurkat cells 
transfected with the HLA-A2.1/K'' chimeric gene (Vitiello et al. J. Exp. Med. 173:1007 
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(1 991)) and .221 tumor cells transfected with HLA-Al 1/K.'' (Alexander et al, J. Immunol. 
159:4753 (1997)). 

To measure presentation of endogenously processed epitopes, Jurkat- 
A2.1/K'' ceils were transfected with the pMin.l or pMin.2-GFP minigenes then tested in a 
cytotoxicity assay against epitope-specific CTL lines. For transfection, Jurkat-A2.1/K'' 
cells were resuspended at 10'' cells/ml and 30 \x% of DNA was added to 600 |j,l of cell 
suspension. After electroporating cells in a 0.4 cm cuvette at 0.25 kV, 960 jiFd, cells 
were incubated on ice for 10 min then cultvired for 2 d in RPMI culture medium. Cells 
were then cultured in medium containing 200 U/ml hygromycin B (Calbiochem, San 
Diego CA) to select for stable transfectants. FACS was used to enrich the fraction of 
green fluorescent protein (GFP)-expressing cells fi:om 15% to 60% (data not shown). 

Methods for measuring the quantitative binding of peptides to purified 
HLA-A2.1 and -Al 1 molecules is described in Ruppert et al. Cell 74:929 (1993); Sette et 
al, Mol Immunol 31:813 (1994); Alexander et al. J. Immunol 159:4753 (1997). 

All tumor cell lines and splenic CTLs from primed mice were grown in 
culture medium (CM) that consisted of RPMI 1640 medium with Hepes (Life 
Technologies, Grand Island, NY) supplemented with 10% FBS, 4 mM L-glutamine, 5 X 
10'^ M 2-ME, 0.5 mM sodium pyruvate, 100 pg/ml streptomycin, and 100 U/ml 
penicillin. 

Construction of minigene multi-epitope DNA plasmids 
pMIN.O and pMIN.l (i.e., pMIN-AOS) were constructed as described 
above and in USSN 60/085,75 1 . 



25 pMin.l-No PADRE and pMin.l -Anchor. pMin. 1 was amplified using two 

overlapping fragments which was then combined to yield the full length product. The 
first reaction used the 5' pcDNA vector primer T7 and either primer 
ATCGCTAGGCAGGAACTTATACAGGATTCG (SEQ ED NO:126) for pMin.l-No 
PADRE or TGGACAGTCCGGCTCCCAGCACCACGT (SEQ ID NO: 127) for pMin.l- 

30 Anchor. The 3 ' half was amplified with the primers TTCCTGCCTAGCGATTTC (SEQ 
ID NO:128) (No PADRE) or GCTGGGAGCCGGACTGTCCAGGTACGT (SEQ ID 
NO: 129) (Anchor) and Min-StopR. The two fragments generated from amplifying the 5' 
and 3' ends were gel purified, mixed, denatured, annealed, and filled in with five cycles 
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of PGR. The full length fragment was furtner amplified with the flanking primers T7 and 
Min-StopR for 25 more cycles. 

pMin.l-No Sig. The Ig signal sequence was deleted from pMin.l by PGR 
5 amplification with primer GCTAGCGCCGCCACCATGCACACCCTGTGGAAGGC 
CGGAATG (SEQ ID NO: 130) and pcDNA rev (Invitrogen) primers. The product was 
cloned into pCR-blunt and sequenced. 

pMin.l -Switch. Three overlapping fragments were amplified from 
10 pMin.l, combined, and extended. The 5' fragment was amplified with the vector primer 
T7 and primer GGGCACCAGCAGGCTGAGCCACAGTCCCAGCACCACGTC (SEQ 
ID N0:131). The second overlapping fragment was amplified with primers 
AGCCTGCTGGTGCCCTTTGTGATCCTGAAGGAGCCTGTGG (SEQ ID NO: 132) 
and AGGCACGTACCTGGAGAGTCCCTTCCAGACAGGCACTCCAT (SEQ ID 
15 NO:133). Primer TGTGCAGGTACGTGGCTAGGCTGTGAGGTACC (SEQ ID 

NO: 134) and the vector primer pcDNA rev (Invitrogen) were used to amplify the third 
(3') fragment. Fragments 1, 2, and 3 were ampUfied and gel purified. Fragments 2 and 3 
were mixed, annealed, amplified, and gel purified. Fragment I was combined with the 
product of 2 and 3, and extended, gel purified and cloned into pcDNA3.1 for expression. 

20 

pMin.2-GFP. The signal sequence was deleted from pMin.O by PGR 
amplification with Min.O-No Sig-5' plus pcDNA rev (Invitrogen) primers 
GCTAGCGCCGGGACCATGCACACCCTGTGGAAGGCCGGAATC (SEQ ID 
NO: 135). The product was cloned into pCR-blunt and sequenced. The insert containing 
25 the open reading frame of the signal sequence-deleted multi-epitope construct was cut out 
with Nhel plus Hindm and ligated into the same sites of pEGFPNl (Clontech). This 
construct fuses the coding region of the signal-deleted pMin.O construct to the N-terrainus 
of green fluorescent protein (GFP). 

30 Immunization of mice 

For DNA immunization, mice were pretreated by injecting 50 p.! of 10 pM 
cardiotoxin (Sigma Chem. Go., #09759) bilaterally into the tibialis anterior muscle. Four 
or five days later, 100 pg of DNA diluted in PBS were injected in the same muscle. 
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Theradigm-HBV lipopeptide (10 mg/ml in DMSO) that was stored at - 
20°C, was thawed for 1 0 min at 45°C before being diluted 1:10 (v/v) with room 
temperature PBS. Inunediately upon addition of PBS, the lipopeptide suspension was 
vortexed vigorously and 100 ill was injected s.c. at the tail base (100 p.g/mouse). 
5 Immunogenicity of individual CTL epitopes was tested by mixing each 

CTL epitope (50 ng/mouse) with the HBV core 128-140 peptide (TPPAYRPPNAPIL 
(SEQ ID NO:124), 140 jig/mouse) which served to induce I-A^-restricted Th cells. The 
peptide cocktail was then emuslifed in incomplete Freund's adjuvant (Sigma Chem. Co.) 
and 100 yd of peptide emulsion was injected s.c. at the tail base. 

10 

In vitro CTL cultures and cytotoxicity assays 

Eleven to 14 days after immunization, animals were sacrificed and a single 
cell suspension of splenocytes prepared. Splenocytes from cDNA-primed animals were 
stimulated in vitro with each of the peptide epitopes represented in the minigene. 

1 5 Splenocytes (2.5-3 .0X1 OVflask) were cultured in upright 25 cm^ flasks in the presence 
of 10 ng/ml peptide and I O' irradiated spleen cells that had been activated for 3 days with 
LPS (25 lig/ml) and dextran sulfate (7 ng/ml). Triplicate cultures were stimulated with 
each epitope. Five days later, cultures were fed with fresh CM. After 10 d of in vitro 
culture, 2-4 X 10^ CTLs from each flask were restimulated with 1 0^ LPS/dextran sulfate- 

20 activated splenocytes treated with 100 fxg/ml peptide for 60-75 min at 37°C, then 

irradiated 3500 rads. CTLs were restimulated in 6-welI plates in 8 ml of cytokine-free 
CM. Eighteen hr later, cultures received cj^okines contained in con A-activated 
splenocyte supematant (10-15% final concentration, v/v) and were fed or expanded on the 
third day with CM containing 10-15% cytokine supemate. Five days after restimulation, 

25 CTL activity of each culture was measured by incubating varying numbers of CTLs with 
lO"^ ^'Cr-labelled target cells in the presence or absence of peptide. To decrease 
nonspecific cytotoxicity from NK cells, YAC-1 cells (ATCC) were also added at a YAC- 
1 :^^Cr-labeled target cell ratio of 20: 1 . CTL activity against the HBV Pol 551 epitope 
was measured by stimulating DNA-primed splenocytes in vitro with the native A- 

30 containing peptide and testing for cytotoxic activity against the same peptide. 

To more readily compare responses, the standard E:T ratio vs % 
cytotoxicity data cur\'es were converted into LU per 10^ effector cells with one LU 
defined as the lytic activity required to achieve 30% lysis of target cells at a 100:1 E:T 
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ratio. Specific CTL activity (ALU) was calculated by subtracting the LU value obtained 
in the absence of peptide from the LU value obtained with peptide. A given culture was 
scored positive for CTL induction if all of the following criteria were met: 1) ALU >2; 2) 
LU(+ peptide) + LU(- peptide) > 3; and 3) a >10% difference in % cytotoxicity tested 
5 with and without peptide at the two highest E:T ratios (starting E:T ratios were routinely 
between 25-50:1). 

CTL lines were generated from pMin.l -primed splenocytes through 
repeated weekly stimulations of CTLs with peptide-treated LPS/DxS-activated 
splenocytes using the 6-well culture conditions described above with the exception that 
10 CTLs were expanded in cytokine-containing CM as necessary during the seven day 
stimulation period. 

Cytokine assay 

To measure EFN-y production in response to minigene-transfected target 
15 cells, 4 X lO'^ CTLs were cultured with an equivalent number of minigene-transfected 
Jurkat-A2.iyK'' cells in 96-well flat bottom plates. After overnight incubation at 37°C, 
culture supernatant from each well was collected and assayed for IFN-y concentration 
using a sandwich ELISA. Immulon II microliter wells (Dynatech, Boston, MA) were 
coated overnight at 4°C with 0.2 p,g of anti-mouse IFN-y capture Ab, R4-6A2 
20 (Pharmingen). After washing wells with PBS/0.1% Tween-20 and blocking with 1% 
BSA, Ab-coated wells were incubated with culture supemate samples for 2 hr at room 
temperature. A secondary anti-EFN-y Ab, XMG1.2 (Pharmingen), was added to wells and 
allowed to incubate for 2 hr at room temperatiu-e. Wells were then developed by 
incubations with Avidin-DH and fmally with biotinylated horseradish peroxidase H 
25 (Vectastain ABC kit, Vector Labs, Burlingame, CA) and TMB peroxidase substrate 

(ICirkegaard and Perry Labs, Gaithersberg, MD). The amount of cytokine present in each 
sample was calculated using a rIFN-y standard (Pharmingen). 

6, Results 

30 Selection of epitopes and minieene construct design 

In the first series of experiments, the issue was whether a balanced 
multispecific CTL response could be induced by simple minigene cDNA constructs that 
encode several dominant HLA class I-restricted epitopes. Accordingly, nine CTL 
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epitopes were chosen on the basis of their relevance in CTL immunity during HBV and 
HIV infection in humans, their sequence conservancy among viral subtypes, and their 
class I MHC binding affinity (Table 10). Of these nine epitopes, six are restricted by 
HLA-A2.1 and three showed HLA-Al 1 -restriction. One epitope, HBV Pol 55 1, was 
5 studied in two alternative forms: either the wild type sequence or an analog (HBV Pol 
551-V) engineered for higher binding affinity. 

As referenced in Table 10, several independent laboratories have reported 
that these epitopes are part of the dominant CTL response during HBV or HIV infection. 
All of the epitopes considered showed greater than 75% conservancy in primary amino 

1 0 acid sequence among the different HBV subtypes and HIV clades. The MHC binding 
affinity of the peptides was also considered in selection of the epitopes. These 
experiment addressed the feasibility of immunizing with epitopes possessing a wide range 
of affinities and, as shown in Table 10, the six HBV and three HIV HLA-restricted 
epitopes covered a spectrum of MHC binding affinities spanning over two orders of 

15 magnitude, with IC5o% concentrations ranging firom 3 nM to 200 nM. 

The immunogenicity of the six A2.1- and three Al I -restricted CTL 
epitopes in transgenic mice was verified by co-immunization with a helper T cell peptide 
in an IFA formulation. All of the epitopes induced significant CTL responses in the 5 to 
73 ALU range (Table 10). As mentioned above, to improve the MHC binding and 

20 immunogenicity of HBV Pol 55 1, the C-terminal A residue of this epitope was substituted 
with V resulting in a dramatic 40-fold increase in binding affinity to HLA-A2.1 (Table 
10). While the parental sequence was weakly or nonimmunogenic in HLA transgenic 
mice, the HBV Pol 551-V analog induced significant levels of CTL activity when 
administered in IFA (Table 10). On the basis of these results, the V analog of the HBV 

25 Pol 551 epitope was selected for the initial minigene construct. In all of the experiments 
reported herein, CTL responses were measured with target cells coated with the native 
HBV Pol 551 epitope, irrespective of whether the V analog or native epitope was utihzed 
for immunization. 

Finally, since previous studies indicated that induction of T cell help 

30 significantly improved the magnitude and duration of CTL responses (Vitiello et al, J. 

Clin. Invest. 95:341 (1995); Livingston et al., J. Immunol. 159:1383 (1997)), tiie universal 
Th cell epitope PADRE was also incorporated into the minigene. PADRE has been 
shown previously to have high MHC binding affinity to a wide range of mouse and 
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human MHC class II haplotypes (Alexander et ai, Immunity 1 :75 1 (1 994)). In particular, 
it has been previously shown that PADRE is highly inmiunogenic in H-2'' mice that are 
used in the current study (Alexander et ai, Immunity 1:751 (1994)). 

pMin.l, the prototype cDNA minigene construct encoding nine CTL 
5 epitopes and PADRE, was synthesized and subcloned into the pcDNA3.1 vector. The 
position of each of the nine epitopes in the minigene was optimized to avoid j-unctional 
mouse H-2'' and HLA-A2.1 class I MHC epitopes. The mouse Ig k signal sequence was 
also included at the 5' end of the construct to facilitate processing of the CTL epitopes in 
the endoplasmic reticulum (ER) as reported by others (Anderson et at., J. Exp. Med. 
10 174:489 (1991)). To avoid further conformational structure in the translated polypeptide 
gene product that may affect processing of the CTL epitopes, an ATG stop codon was 
introduced at the 3 ' end of the minigene construct upstream of the coding region for c- 
myc and poly-his epitopes in the pcDNAS.l vector. 

15 Immunogenicity of pMin.l in transgenic mice 

To assess the capacity of the pMin. 1 minigene construct to induce CTLs in 
vivo, HLA-A2.1/K''-H-2'"'^ transgenic mice were immunized intramuscularly with 100 |J.g 
of naked cDNA. As a means of comparing the level of CTLs induced by cDNA 
immunization, a control group of animals was also immunized with Theradigm-HB V, a 

20 palmitolyated lipopeptide consisting of the HBV Core 1 8 CTL epitope linked to the 
tetanus toxin 830-843 Th cell epitope. 

Splenocytes from immunized animals were stimulated twice with each of 
the peptide epitopes encoded in the minigene, then assayed for peptide-specific cytotoxic 
activity in a ^ ' Cr release assay. A representative panel of CTL responses of pMin. 1 - 

25 primed splenocytes, shown in Figure 22, clearly indicates that significant levels of CTL 
induction were generated by minigene immunization. The majority of the cultures 
stimulated with the different epitopes exceeded 50% specific lysis of target cells at an E:T 
ratio of 1 : 1 . The results of four independent experiments, compiled in Table 1 1 , indicate 
that the pMin.l construct is indeed highly immunogenic in HLA-A2.1/K''-H-2^''^ 

30 transgenic mice, inducing a broad CTL response directed against each of its six A2.1- 
restricted epitopes. 

To more conveniently compare levels of CTL induction among the 
different epitopes, the % cytotoxicity values for each splenocyte culture was converted to 
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ALU and the mean ALU of CTL activity in positive cultures for each epitope was 
determined {see Example V, materials and methods, for positive criteria). The data, 
expressed in this manner in Table 1 1, confirms the breadth of CTL induction elicited by 
pMin.l immunization since extremely high CTL responses, ranging between 50 to 700 
5 ALU, were observed against the six A2.1-restricted epitopes. More significantly, the 
responses of several hundred ALU observed for five of the six epitopes approached or 
exceeded that of the Theradigm-HBV lipopeptide, a vaccine formulation known for its 
high CTL-inducing potency (Vitiello et al. J. Clin. Invest. 95:341 (1995); Livingston et 
al, J. Immunol. 159:1383 (1997)). TheHBV Env 335 epitope was the only epitope 
10 showing a lower mean ALU response compared to Upopeptide (Table 1 1, 44 vs 349 
ALU). 

Processing of minigene epitopes bv transfected cells 

The decreased CTL response observed against HBV Env 335 was 

15 somewhat unexpected since this epitope had good A2.1 binding affinity (IC50%, 5 nM) 
and was also immunogenic when administered in IF A. The lower response may be due, 
at least in part, to the inefficient processing of this epitope from the minigene polypeptide 
by antigen presenting cells following in vivo cDNA immunization. To address this 
possibility, Jurkat-A2.1/K'' tumor cells were transfected with pMin.l cDNA and the 

20 presentation of the HB\' Env 335 epitope by transfected cells was compared to more 
immunogenic A2.1 -restricted epitopes using specific CTL lines. Epitope presentation 
was also studied using nimor cells transfected with a control cDNA construct, pMin.2- 
GFP, that encoded a similar multi-epitope minigene flised with GFP which allows 
detection of minigene expression in transfected cells by FACS. 

25 Epitope presentation of the transfected Jurkat cells was analyzed using 

specific CTL lines, with cytotoxicity or IFN-y production serving as a read-out. It was 
foimd that the levels of CTL response correlated directly with the in vivo immunogenicity 
of the epitopes. Highly immunogenic epitopes in vivo, such as HBV Core 18, HIV Pol 
476, and HBV Pol 455, were efficiently presented to CTL lines by pMin.l - or pMin.2- 

30 GFP-transfected cells as measured by IFN-y production (Figure 23 A, >1 00 pg/ml for each 
epitope) or cytotoxic activity (Figure 23C, >30% specific lysis). In contrast to these high 
levels of zn vitro activit>', the stimulation of the HBV Env 335-specific CTL line against 
both populations of transfected cells resulted in less than 12 pg/ml IFN-y and 3% specific 
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lysis. Although the HBV Env 335-specific CTL line did not recognize the naturally 
processed epitope efficiently, this line did show an equivalent response to peptide-loaded 
target cells, as compared to CTL lines specific for the other epitopes (Figure 23B, D). 
Collectively, these results suggest that a processing and/pr presentation defect associated 
5 with the HBV Env 335 epitope that may contribute to its diminished immunogencity in 
vivo. 

Effect of the helper T cell epitope PADRE on minigene immunogenicity 
Having obtained a broad and balanced CTL response in transgenic mice 

1.0 immunized with a minigene cDNA encoding multiple HLA-A2. 1 -restricted epitopes, next 
possible variables were examined that could influence the immunogenicity of the 
prototype construct. This type of analysis could lead to rational and rapid optimization of 
future constructs. More specifically, a cDNA construct based on the pMin.l prototype 
was synthesized in which the PADRE epitope was deleted to examine the contribution of 

15 T cell help in minigene immunogenicity (Figure 24 A). 

The results of the immunogenicity analysis indicated that deletion of the 
PADRE Th cell epitope resulted in significant decreases in the frequency of specific CTL 
precursors against four of the minigene epitopes (HBV Core 18, HIV Env 120, HBV Pol 
455, and HBV Env 335) as indicated by the 17 to 50% CTL-positive cultures observed 

20 against these epitopes compared to the 90-100% frequency in animals immunized with 
the prototype pMin.l construct (Figure 25). Moreover, for two of the epitopes, HBV 
Core 18 and HIV Env 120, the magnitude of response in positive cultures induced by 
pMin.l-No PADRE was 20- to 30-fold less than that of the pMin.l construct (Figure 
25A). 

25 

Effect of modulation of MHC bindine affinity on epitope immunogenicity 
Next a construct was synthesized in which the V anchor residue in HBV 
Pol 551 was replaced with alanine, the native residue, to address the effect of decreasing 
MHC binding on epitope immunogenicity (Figure 24B). 
30 Unlike deletion of the Th cell epitope, decreasing the MHC binding 

capacity of the HBV Pol 551 epitope by 40-fold through modification of the anchor 
residue did not appear to affect epitope immunogenicity (Figure 25B). The CTL response 
against the HBV Pol 551 epitope, as well as to the other epitopes, measured either by LU 
or fi:equency of CTL-positive cultures, was very similar between the constructs 
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containing the native A or improved V residue at the MHC binding anchor site. This 
finding reinforces the notion that minimal epitope minigenes can efficiently dehver 
epitopes of vastly different MHC binding affinities. Furthermore, this finding is 
particularly relevant to enhancing epitope immunogenicity via different delivery methods, 
5 especially in light of the fact that the wild type HBV Pol 551 epitope was essentially 
nonimmimogenic when delivered in a less potent IFA emulsion. 

Effect of the signal sequence on minieene construct immunogenicity 
The signal sequence was deleted from the pMui.l construct, thereby 

10 preventing processing of the minigene polypeptide in the ER (Figure 24C). When the 
immunogenicity of the pMin.l-No Sig construct was examined, an overall decrease in 
response was found against four CTL epitopes. Two of these epitopes, HIV Env 120 and 
HBV Env 335, showed a decrease in frequency of CTL-positive cultures compared to 
pMin.l while the remaining epitopes, HBV Pol 455 and HIV Pol 476, showed a 16-fold 

15 (from 424 to 27 ALU) and 3-fold decrease (709 to 236 ALU) in magnitude of the mean 
CTL response, respectively (Figure 25C). These findings suggest that allowing ER- 
processing of some of the epitopes encoded in the pMin. 1 prototype construct may 
improve immunogenicity, as compared with constructs that allow only cytoplasmic 
processing of the same panel of epitopes. 

20 

Effect of epitope rearrangement and creation of new junctional epitopes 
In the final construct tested, the immunogenicity of the HBV Env 335 
epitope was analyzed to determine whether it may be influenced by its position at the 3 ' 
terminus of the minigene construct (Figure 24D). Thus, the position of the Env epitope in 

25 the cDNA construct was switched with a more immunogenic epitope, HBV Pol 455, 
located in the center of the minigene. It should be noted that this modification also 
created two potentially new epitopes. As shown in Figure 25D, the transposition of the 
two epitopes appeared to affect the immunogenicity of not only the transposed epitopes 
but also more globally of other epitopes. Switching epitopes resulted in obliteration of 

30 CTL induction against HBV Env 335 (no positive cultures detected out of six). The CTL 
response induced by the terminal HBV Pol 455 epitope- was also decreased but only 
slightly (424 vs 78 mean ALU). In addition to the switched epitopes, CTL induction 
. against other epitopes in the pMin.l -Switch construct was also markedly reduced 
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compared to the prototvpe construct. For example, a CTL response was not observed 
against the HIV Env 120 epitope and it was significantly diminished against the HBV 
Core 18 (4 of 6 positive cultvtres, decrease in mean ALU from 306 to 52) and HBV Pol 
476 (decrease in mean ALU from 709 to 20) epitopes (Figure 25D). 
5 As previously mentioned, it should be noted that switching the two 

epitopes had created new junctional epitopes. Indeed, in the pMin.l -Switch construct, 
two new potential CTL epitopes were created from sequences of HBV Env 335-HIV Pol 
476 (LLVPFVIL (SEQ ID NO:135), H-2K''-restricted) and HBV Env 335-HBV Pol 551 
(VLGVWLSLLV (SEQ ED NO:136), HLA-A2.1 -restricted) epitopes. Although these 
10 junctional epitopes have not been examined to determine whether or not they are indeed 
immunogenic, this may account for the low immunogenicity of the HBV Env 335 and 
HIV Pol 476 epitopes. These findings suggest that avoiding junctional epitopes may be 
important in designing multi-epitope minigenes as is the ability to confirm their 
immunogenicity in vivo in a biological assay system such as HLA transgenic mice. 

15 

Induction of CTLs against Al 1 epitopes encoded in pMin.l 
To further examine the flexibility of the minigene vaccine approach for 
inducing a broad CTL response against not only multiple epitopes but also against 
epitopes restricted by different HLA alleles, HLA-Al IIY^ transgenic mice were 

20 immunized to determine whether the three Al 1 epitopes in the pMin.l construct were 
immunogenic for CTLs, as was the case for the A2.1 -restricted epitopes in the same 
constmct. As summarized in Table 12, significant CTL induction was observed in a 
majority of cultures against all three of the HLA-Al 1-restricted epitopes and the level of 
CTL immunity induced for the three epitopes, in the range of 40 to 260 ALU, exceeded 

25 that of peptides delivered in IFA (Table 10). Thus, nine CTL epitopes of varying HLA 
restrictions incorporated into a prototype minigene construct all demonstrated significant 
CTL induction in vivo, confirming that minigene DNA plasmids can serve as means of 
delivering multiple epitopes, of varying HLA restrictions and MHC binding affinities, to 
the immune system in an immunogenic fashion and that appropriate transgenic mouse 

30 strains can be used to measure DNA construct immunogenicity in vivo. 

CTLs were also induced against three Al 1 epitopes in Al l/K*' transgenic 
mice. These responses suggest that minigene delivery of multiple CTL epitopes that 
confers broad population coverage may be possible in humans and that transgenic animals 
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of appropriate haplotypes may be a useful tools in optimizing the in vivo immunogenicity 
of minigene DNA. In addition, animals such as monkeys having conserved HLA 
molecules with cross reactivity to CTL and HTL epitopes recognized by human MHC 
molecules can be used to determine hiiman immunogenicity of HTL and CTL epitopes 
5 {Bertoni et al.. J. Immunol. 1 6 1 ;4447-4455 (1 998)). 

This study represents the fust description of the use of HLA transgenic 
mice to quantitate the in vivo immunogenicity of DNA vaccines, by examining response 
to epitopes restricted by human HLA antigens. In vivo studies are required to address the 
variables crucial for vaccine development, that are not easily evaluated by in vitro assays, 

1 0 such as route of administration, vaccine formulation, tissue biodistribution, and 

involvement of primarv- and secondary Ijraphoid organs. Because of its simplicity and 
flexibility, HLA transgenic mice represent an attractive alternative, at least for initial 
vaccine development studies, compared to more cumbersome and expensive studies in 
higher animal species, such as nonhuman primates. The in vitro presentation studies 

15 described aboye further supports the use of HLA transgenic mice for screening DNA 
constructs containing human epitopes inasmuch as a direct correlation between in vivo 
immunogenicity and in vitro presentation was observed. Finally, strong CTL responses 
were observed against all six A 2.1 restricted viral epitopes and in three Al 1 restricted 
epitopes encoded in the prototype pMin.l construct. For five of the A 2.1 restricted 

20 epitopes, the magnimde of CTL response approximated that observed with the 

lipopeptide, Theradigm-HBV, that previously was shown to induce strong CTL responses 
in humans (Vitiello et al, J. Clin. Invest. 95:341 (1995); Livingston et al, J. Immunol. 
159:1383 (1997)). 
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Table 9. Activation of T Cell Proliferation by Expression 
Vectors Encoding MHC Class II Epitopes Fused to MHC 
Class II Targeting Sequences 

5 

Immunogen Stimulating Peptide^ 

PADRE OVA 323 CORE 128 



peptide CFA^ 


3.0(1.1) 


2.7 (1.2) 


3.2(1.4) 


pEP2.(PAOS).(-) 








pEP2.(A0S).(-) 


5.6(1.8) 






pEP2.(PAOS).(sigTh) 


5.0 (2.9) 




2.6(1.5) 


pEP2.(PA0S).(IgaTh) 


5.6 (2.1) 




3.0 (1.6) 


pEP2.(PA0S).(LampTh) 


3.8(1.7) 




3 


pEP2.(PAOS).(IiTh) 


5.2 (2.0) 


3.2 (1.5) 


3.7(1.5) 


pEP2.(PAOS).(H2M) 


3.3 (1.3) 




2.8 



'Geometric mean of cultures with SI > 2. 
^Proliferative response measured in the lymph node. 
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Table 10 
CTL Epitopes in cDNA Minigene 



Immunogenicity In Vivo (IF A) 



MHC 
Binding 



No. CTL- 
Positive 
Culture 









[IC30% (nM) 




ALU 


HBV Core 18 


FLPSDFFPSV 


A2.1 


3 


6/6 


73.0(1.1) 


HBV Env335 


WLSLLVPFV 


A2.1 




4/6 


5.3(1.6) 


HBV Pol 455 


GLSRYVARL 


A2.1 


76 


ND" 


ND 


HIVEnvI20 


KLTPLCVTL 


A2.1 


102 


2/5 


6.4(1.3) 


HIV Pol 476 


ILKEPVHGV 


A2.1 


192 


2/5 


15.2(2.9) 


HBV Pol55I-A 


YMDDWLGA 


A2.1 


200 


0/6 




HBV Pol551-V 


YMDDWLGV 


A2.1 


5 


6/6 


8.2 (2.3) 


HIVEnv49 


TVYYGVPVWK 


All 


4 


28/33 


13.4 (3.1) 


HBV Core 141 


STLPETTWRR 


All 


4 


6/6 


12.1 (2.6) 


HBV Pol 149 


HTLWKAGILYK 


All 


14 


6/6 


13.1 (1.2) 



a Peptide tested in HLA-A2. l/K*" H-2 transgenic mice by co-immunizing with a T helper cell peptide in IFA. 
5 b Geometric mean CTL response of positive cultures, 
c ND, not done. 
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Table 11 

Summary of Immunogenicity of pMin.l DNA 
construct in HLA Al.l/K*^ transgenic mice 



CTL Response 



Epitope No. Positive Geo. Mean Response Positive 
Cultures/Total Cultures [x/-i-SD] 

ALU 

HBVCorelS 9/9 455.5 [2.2] 

HIVEnvl20 - 12/12 211.9 [3.7] 

HBVPoi551-V 9/9 126.1 [2.8] 

HBVP0I455 12/12 738.6 [1.3] 

HIV Pol 476 11/11 716.7 [1.5] 

HBVEnv335 12/12 43.7 [1.8] 

HBVCore IS 10/ 10 349.3 [1.8] 
(Theradigm)" 

' Mice were immunized with plVIin.l DNA or Theradigm-HBV lipopeptide and CTL 
activity in splenoc^-te cultures was determined after in vitro stimulation with 
5 individual peptide epitopes. Results from four independent experiments are shown. 

See Example V, Materials and Methods for definition of a CTL-positive culture. 

Response of mice immunized with Theradigm-HBV lipopeptide containing the HBV 
Core 18 epitope. 
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Table 12 
Summary of immunogenicity 
in HLA Al 1/K'' transgenic mice 



CTL Response^ 



Epitope 


No. Positive 
Cultures/Total'' 


Geo. Mean Response 
Positive Cultures fx/-^- SDl 






ALU 


HBV Core 141 


5/9 


128.1 [1.6] 


HBV Pol 149 


6/9 


267.1 [2.2] 


HIV Env 43 


9/9 


40.1 [2.9] 



" Mice were immunized with pMin. 1 DNA and CTL activity in splenoc>te cultures was 
determined after in vitro stimulation with individual Al 1-restricted epitopes. The 
5 geometric mean CTL response from three independent experiments are shown. 

Definition of a CTL -positive culture is described in Example V, Materials and 
Methods. 
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WHAT IS CLAIMED IS: 

1 1 . An expression vector comprising a promoter operably linked to a 

2 first nucleotide sequence encoding a major histocompatibility (MHC) targeting sequence 

3 fused to a second nucleotide sequence encoding two or more heterologous peptide 

4 epitopes, wherein the heterologous peptide epitopes comprise two HTL peptide epitopes 

5 or a CTL peptide epitope and a universal HTL peptide epitope. 

1 2. The expression vector of claim 1, wherein the heterologous peptide 

2 epitopes comprise two or more heterologous HTL peptide epitopes. 

1 3 . The expression vector of claim 1 , wherein the heterologous peptide 

2 epitopes comprise a CTL peptide epitope and a universal HTL peptide epitope. 

1 4. The expression vector of claim 2, wherein the heterologous peptide 

2 epitopes further comprise one or more CTL peptide epitopes. 

1 5. The expression vector of claim 3, wherein the heterologous peptide 

2 epitopes further comprise two or more CTL peptide epitopes. 

1 6. The expression vector ofclaim 3, wherein the heterologous peptide 

2 epitopes further comprise two or more HTL peptide epitopes. 

1 7. The expression vector of claim 2, wherein one of the HTL peptide 

2 epitopes is a universal HTL epitope. 

1 8. The expression vector ofclaim 3 or 7, wherein the universal HTL 

2 epitope is a pan DR epitope. 

1 9. The expression vector of claim 8, wherein the pan DR epitope has 

2 the sequence AlaLysPheVal AlaAlaTrpThrLeuLysAlaAlaAla (SEQ ID N0:3 8). 

1 10. The expression vector of claim 1, wherein the peptide epitopes are 

2 hepatitis B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus 

3 epitopes, human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, 

4 PAP epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 
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1 II. The expression vector of claim 10, wherein the peptide epitopes 

2 each have a sequence selected from the group consisting of the peptides depicted in 

3 Tables 1-8. 

1 12. The expression vector of claim 1 1 , wherein at least one of the 

2 peptide epitopes is an analog of a peptide depicted in Tables 1-8. 

1 13. The expression vector of claim 1 , wherein the MHC targeting 

2 sequence comprises a region of a polypeptide selected from the group consisting of the li 

3 protein, LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B 

4 surface antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, lg-|3 protein, and 

5 Ig kappa chain signal sequence, 

1 14. The expression vector of claim 1 , wherein the expression vector 

2 further comprises a second promoter sequence operably linked to a third nucleotide 

3 sequence encoding one or more heterologous HTL or CTL peptide epitopes. 

1 15. The expression vector of claim 1 , wherein the vector comprises 

2 pMinlorpEP2. 

1 16. The expression vector of claim 3 or 4, wherein the CTL peptide 

2 epitope comprises a structural motif for an HLA supertype, whereby the peptide CTL 

3 epitope binds to two or more members of the supertype with an affinity of greater that 

4 500 nM. 

1 1 7. The expression vector of claim 4 or 5 , wherein the CTL peptide 

2 epitopes have structural motifs that provide binding affinity for more than one HLA allele 

3 supertype. 

1 1 8, A method of inducing an immune response in vivo comprising 

2 administering to a mammalian subject an expression vector comprising a promoter 

3 operably linked to a first nucleotide sequence encoding a major histocompatibility (MHC) 

4 targeting sequence fused to a second nucleotide sequence encoding two or more 

5 heterologous peptide epitopes, wherein the heterologous peptide epitopes comprise two 

6 HTL peptide epitopes or a CTL peptide epitope and a universal HTL peptide epitope. 
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1 19. The method of claim 1 8, wherein the heterologous peptide epitopes 

2 comprise two or more heterologous HTL peptide epitopes. 

1 20. The method of claim 1 8, wherein the heterologous peptide epitopes 

2 comprise a CTL peptide epitope and a universal HTL peptide epitope. 

1 21. The method of claim 1 9, wherein the heterologous peptide epitopes 

2 further comprise one or more CTL peptide epitopes. 

1 22. The method of claim 20, wherein the heterologous peptide epitopes 

2 flirther comprise two or more CTL peptide epitopes. 

1 23 . The method of claim 20, wherein the heterologous peptide epitopes 

2 further comprise two or more HTL peptide epitopes. 

1 24. The method of claim 1 9, wherein the HTL peptide epitope is a 

2 universal HTL epitope. 

1 25 . The method of claim 20 or 24, wherein the universal HTL epitope 

2 is a pan DR epitope. 

1 26- The method of claim 25, wherein the pan DR epitope has the 

2 sequence AlaLysPheValAlaAkTrpThrLeuLysAlaAlaAla (SEQ ID NO:38). 

1 27. The method of claim 1 8, wherein the peptide epitopes are hepatitis 

2 B virus epitopes, hepatitis C virus epitopes, hiraian immunodeficiency virus epitopes, 

3 human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PAP epitopes, PSM 

4 epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 

1 28. The method of claim 27, wherein the peptide epitopes each have a 

2 sequence selected from the group consisting of the peptides depicted in Tables 1-8. 

1 29. The method of claim 28, wherein least one of the peptide epitopes 

2 is an analog of a peptide depicted in Tables 1-8. 

1 30. The method of claim 1 8, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 

3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface 
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4 antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, Ig-P protein, and Ig 

5 kappa chain signal sequence. 

1 31. The method of claim 18, wherein the expression vector fiirther 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous HTL or CTL peptide epitopes. 

1 32 . The method of claim 1 8 , wherein the vector comprises pMin. 1 or 

2 pEP2. 

1 33. The method of claim 20 or 21 , wherein the CTL peptide epitope 

2 comprises a structural motif for an HLA supertype, whereby the peptide epitope binds to 

3 two or more members of the supertype with an affinity of greater that 500 nM. 

1 34. The method of claim 21 or 22, wherein the CTL peptide epitopes 

2 have structural motifs that provide binding affinity for more than one HLA allele 

3 supertype. 

1 35. A method of inducing an immune response mvfvo comprising 

2 administering to a mammalian subject an expression vector comprising a promoter 

3 operably linked to a first nucleotide sequence encoding a major histocompatibility (MHC) 

4 targeting sequence fused to a second nucleotide sequence encoding a heterologous human 

5 HTL peptide epitope. 

1 36. The method of claim 35, wherein the second nucleotide sequence 

2 further comprises two or more heterologous HTL peptide epitopes. 

1 37. The method of claim 35, wherein the second nucleotide sequence 

2 ftuther comprises one or more heterologous CTL peptide epitopes. 

1 38. The method ofclaim 35, wherein the HTL peptide epitope is a 

2 universal HTL peptide epitope 

1 39. The method ofclaim 38, wherein the universal HTL epitope is a 

2 pan DR epitope. 

1 40. The method of claim 3 9, wherein the pan DR epitope has the 

2 sequence AlaLysPheValAlaAIaTrpThrLeuLysAlaAIaAla (SEQ ID N0:3 8). 
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1 41 . The method of claim 37, wherein the HTL and CTL peptide 

2 epitopes are hepatitis B virus epitopes, hepatitis C virus epitopes, human 

3 immunodeficiency virus epitopes, human papilloma virus epitopes, MAGE epitopes, PSA 

4 epitopes, PAP epitopes, PSM epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, 

5 or Plasmodium epitopes. 

1 42. The method of claim 41, wherein the peptide epitopes each have a 

2 sequence selected from the group consisting of the peptides depicted in Tables 1-8. 

1 43. The method ofclaim 42, wherein at least one of the peptide 

2 epitopes is an analog of a peptide depicted in Tables 1 -8. 

1 44. The method of claim 35, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 

3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza matrix protein, hepatitis B surface 

4 antigen, hepatitis B virus core antigen, Ty particle, Ig-a protein, Ig-p protein, and Ig 

5 kappa chain signal sequence. 

1 45 . The method of claim 35, wherein the expression vector further 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous HTL or CTL peptide epitopes. 

1 46. The method of claim 37, wherein the CTL peptide epitope 

2 comprises a structural motif for an HLA supertype, whereby the peptide epitope binds to 

3 two or more members of the supertype with an affinity of greater that 500 nM. 

1 47. The method ofclaim 37, wherein the CTL peptide epitopes have 

2 strucmral motifs that provide binding affmity for more than one HLA allele supertype. 

1 48. A method of assaying the human immunogenicity of a human T 

2 cell peptide epitope in vivo in a non-human mammal, comprising the step of 

3 administering to the non-hxmaan mammal an expression vector comprising a promoter 

4 operably linked to a first nucleotide sequence encoding a heterologous human CTL or 

5 HTL peptide epitope. 
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1 49. The method of claim 48, wherein the first nucleotide sequence 

2 encodes two or more heterologous CTL or HTL peptide epitopes. 

1 50 . The method of claim 48, wherein the non-human mammal is a 

2 transgenic mouse that expresses a human HLA allele. 

1 51. The method of claim 50, wherein the human HLA allele is selected 

2 from the group consisting of Al 1 and A2. 1 . 

1 52. The method of claim 48, wherein the expression vector further 

2 comprise a second nucleotide sequence encoding a major histocompatiblity (MHC) 

3 targeting sequence. 

1 53. The method ofclaim 48, wherein the HTL peptide epitope is a 

2 universal HTL epitope. 

1 54. The method ofclaim 53, wherein the universal HTL epitope is a 

2 pan DR epitope. 

1 55. The method of claim 54, wherein the pan DR epitope has the 

2 sequence AlaLysPheValAlaAlaTrpThxLeuLysAlaAlaAla (SEQ ID NO:38). 

1 56. The method of claim 48, wherein the CTL or HTL peptide epitopes 

2 are hepatitis B virus epitopes, hepatitis C virus epitopes, human immunodeficiency virus 

3 epitopes, human papilloma virus epitopes, MAGE epitopes, PSA epitopes, PSM epitopes, 

4 PAP epitopes, p53 epitopes, CEA epitopes, Her2/neu epitopes, or Plasmodium epitopes. 

1 57. The method of claim 56, wherein the CTL or HTL peptide epitopes 

2 each have a sequence selected from the group consisting of the peptides depicted in 

3 . Tables 1-8. 

1 58. The method of claim 57, wherein at least one of the peptide 

2 epitopes is an analog of a peptide depicted ia Tables 1-8. 

1 59. The method ofclaim 52, wherein the MHC targeting sequence 

2 comprises a region of a polypeptide selected from the group consisting of the li protein, 
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3 LAMP-I, HLS-DM, HLA-DO, H2-D0, influenza, hepatitis B virus core antigen, Ty 

4 particle, Ig-a protein, Ig-P protein, and Ig kappa chain signal sequence. 

1 60. The method ofclaim 48, wherein the expression vector further 

2 comprises a second promoter sequence operably linked to a third nucleotide sequence 

3 encoding one or more heterologous human CTL or HTL peptide epitopes. 

1 61. The method of claim 48, wherein the vector comprises pMin. 1 or 

2 pEP2. 

1 62. The method of claim 48, wherein the CTL peptide epitope has a 

2 structural motif that provides binding affinity for an HLA allele supertype. 

1 63 . The method of claim 49, wherein the CTL peptide epitopes have 

2 structural motifs that provide binding affinity for more than one HLA allele supertype. 

1 64. The method of claim 48, wherein the expression vector comprises 

2 both HTL and CTL peptide epitopes. 
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GCTAGCGCCGCCACCATGGATGACCAACGCGACCTCATCTCTAACCATGAGCAATXGCCCATACTGGGCA 
CGATCGCGGCGGTGGTACCTACTGGTTGCGCTGGAGTAGAGA.TTGGTACTCGTTAACGGGTATGACCCGT 
MDI3QRDLrSNHEQLPILG> 



ACCGCCCTAGAGAGCCAGAAAGGTGCAGCCGTGGAGCTCTC 
TGGCGGGATCTCTCGGTCriTCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAGGACCACCGAGA 
NRPREPSRCSRGALYTGVS VIi'vAL> 



GCTCTTGGCrGGGCAGGCCy^CCACTGCTTACrTCCTCTACCAGCAACAGGGCCGCCTAGAC^ 
CGAGAACCGACCCGTCCGGTGG-rGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
LlaA GQATTAYFLYQQQG RI,DKLT> 



ATCACCTCCCAGAACCTGCaACl^GAGAGCCITCGCATGAAGCTTCCGAAATCTGCCAAACCTGTGGCCA 
TAGTGGAGGGTCn^GGACGTTGACCTCTCGGAAGCGTACITCaAAGGCTTTAGACGGTTTGaACACCGGT 
ITSQNLQIiSSLRMKl.PKSAKPVA> 



AGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTATGTCCATGGATAACATGCTCCTTGGGCCTGTGAA 
TCAAGCACCGACGGACCTGGGACTTCCGACGGCGATACAGGTACCTATrGTACGAGGAACCCGGACACTT 
KFVAAWTLKAAAMSMD NML LG PVK> 



GAACGTTACCAAGTACGGCAACATGACCCAGGACCATGTGATGCATCTGCTCACGAGGTCTGGACCCCTG 
CTTGCAATGGTTCATGCCGTTGTACTGGGTCCTGGTACACTACGTAGACGAGTGCTCCAGACCTGGGGAC 
NVTKYGNMTQDHVMHLLTRSGPL> 



GAGTACCCGCAGCTGAAGGGGACCTTCCCAGAGAATCTGAAGCATCTTAAGAACTCCATGGATGGCGTGA 
CTCATGGGCGTCGAerrrCCCCrrGGAAGGGTCTCTTAGACTTCGTAGAATTCrTGAGGTACCTACCGCACT 
EyPQ.LKG7FPENLKHLKNSMDGV> 



ACEGGAAGATCTTCGAGAGCTGGATGAAGCAGTGGCrCTTGTTTGAGATGAGCAAGAACTCCCTGGAGGA 
TGACCTTCTAGAAGCrCTCGACCTACTTCGTCa^CCGAGAACAAACTCrACTCGTTCTTGAGGGACCTCCT 
NWKI?ESHMKQWI.LFEMSKNSliEE> 



GAAGAAGCCCACCGAGGCTCCACCn:AAAGAGC(a.Cr6GAaVTGGAAGACCTATCrrCTGGCCTGGGAGTG 
CTTCTTCGGGTGGCTCCGAGGTGGATTTCTCGGTGACCTGTACCTTCTGGATAGAAGACCGGACCCTCAC 
KKPTEAP?KEPLDMEDIiSSGLGV> 



ACCAGGCAGGAACTGGGTCAAGTCACCCTGTGAGGTACC 
TGGTCCGTCCTTGACCCAGTTCAGTGGGACACTCCATCM 
TRQELGQVTL*> 

FIGURE 1 
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GCTAGCGCCGCCACC^TGGATGACCflACGCGACCTCATCTCTAACCATGAGCAATTGCCCATACTC3GGCA 
CGATCGCGGCGGTGGTACCTACTGGTTGCGCTGGAGTAGAGATTGGTACTCGTTAACGGGTATGACCCGT 
MDDQRDLISNHEQIiPILG> 



ACCGCCCTAGAGAGCCAGAAAGGTGCAGCCGTGGAGCTCTGTACACCGGTGTTTCTGTCCTGGTGGCTCT 
TGGCGGGATCTCTCGGTCTTTCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAGGACCACCGAGA 
NRPREPERCSRGALYTGVSVLVAL> 



GCTCTTGGCTGGGCAGGCCa.CCACTGCTTACTTCCrGTACCAGCAACAGGGCCGCCTAGACA^ 
CGAGAACCGACCCGTCCGGTGGTGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
LLAGQATTAYFLYQQQGRLDKLT> 



ATCACCTCCCAGAACCTGC^CTGGAGAGCCTTCGCATGAAGCTTATCAGCCAGGCTGTGCACGCCGCTC 
TAGTGGAGGGTCTTGGACGTTGACCTCTCGGAAGCGTACTTCGAATAGTCGGTCCGACACGTGCGGCGAG 
ITSQNrQLESLRMKLISQAVHAA> 



ACGCCGAAATCAACGAAGCTGGAAGAACCCCTCCAGCTTATCGCCCTCCAAACGCTCCTATCCTGTTCTT 
TGCGGCTTTAGTTGCTTCGACCTTCTTGGGGAGGTCGAATAGCGGGAGGTTTGCGAGGATAGGACAAGAA 
KAEINEAGRTPPAYRPPNAP ILFF> 



TCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTGGCTGCCTGGACCCTGAAG 
AGACGACTGGTCTTAGGACTGTTAGGGGGTCAGGGACCTGCGGTTCAAGCACCGACGGACCTGGGACTTC 
LLTR ILTIPQSLDAKFVAAWTLK> 



GCTGCCGCTTGAGGTACC 
CGACGGCGAACTCCATGG 
A A A *> 



FIGURE 2 
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GCTAGCGCCGCCACCyiTGGATGACCflACGCGACCTCarCTCTAACCaTGAGCAATTGCCCaTACTGGGCA. 
CGATCGCGGCGGTGGTACCTACrrGGTTGCGCTGGR.GTAGAGATTGGTACTCGTTAACGGGTATGACCCGT 
MDDQRDLISNHEQLiPXLO 



ACCGCCCTAGAGAGCCAGAAAGGTGaVGCCGTGGAGCTCTGTACACCGGTGTTTCTGTCCTGGTGGCTCT 
TGGCGGGATCTCTCGGTCTTTCCACGTCGGCACCTCGAGACATGTGGCCACAAAGACAGGACCACCGAGA 
NRPR EPERCSRGALYTGVSVLVAL> 



GCTCTTGGCTGGGCAGGCCACCACTGCTTACTTCCTGTACaiGCAACAGGGCCGCCTAGACAAGCT 
CGAGAACCGACCCGTCCGGTGGTGACGAATGAAGGACATGGTCGTTGTCCCGGCGGATCTGTTCGACTGG 
I.LAGQATrAYFLYQQQGRLDKLT> 



ATCACCTCCC»GAACCTGCAACTGGAGAGCCTTCQCATGAAGCTTATCAGCCAGGCTGTGCACGCCGCTC 
TAGTGGAGGGTCTTGGACGTTGACCTCTCGGAAGCGTACTrCQAATAGTCGGTCCGACACGTGCGGCGAG. 
ITSQNLQLESLRMKl4lSQAVHAA> 



ACGCCGAAATCaACGAAGCTGGAAGAACCCCTCCJ^GCTTATCGCCCTCCAAACGCTCCTATCCTGTTCTT 
TGCGGCTTTAGTTGCTTCGACCTTCTTGGGGAGGTCGAATAGCGGGAGGTTTGCGAGGATAGGACAAGAA 
HAEINEAGRTPPAYRPPNAPILFF> 



TCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTCGTGGCTGCCTGGACCCTGAAG 
AGACGACTGGTGTTAGGACTGTTAGC 

LLTRILTZ PQSLDAK 



GCTGCCGCTATGTCOVTGGATAACATGCTCCTTGGGCCTGTGAAGAACGTTACCAAGTACGGCAACATGA 
CGACCGCGATACAGGTACCTATTGTACGAGGAACCCGGAavCTTCTTGCAATGGTTCATGCCGTTGTACr 
AAAM.SMDKMLIiGPVKNVTKYGNM> 



CCCAGGACCATGTGATGCATCTGCTCACGAGGTCTGGACCCCTGGAGTACCCGCAGCTGAAGGGGACCTT 
GGGTCCTGGTACACTACGTAGACGAGTGCTCCAGACCTGGGGACCTCATGGGCGTCGACTTCCCCTGGAA 
TQDHVMHLLTRSGPLEYP QLKGTF> 



CCCaGAGAATCTGAAGCATCTTAAGAACTCCATGGATGGCGTGAACTGGAAGATCTTCGAGAGCTGGATG 
GGGTCrCTrAGACTTCGTAGAArrCTTGAGGTACCTACCGCACTTGACCTTCTAGAAGCTCTCGACCTAC 
PENLKHL?:NSMDGVKWKrFESWM> 



FIGXJRE 3 
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640 S30 660 570 680 690 700 

AAGCAGTGGCTCrrXGTTTGAGATGAGCAAGAACTCCCTGGAGGAGAAGAAGCCCACCGAGGCTCCACCTA 
TTCGTCACCGAGAACaj«.CTCTACTCGrrcriTGAGGGACCTCCTCTTCTTCGGGTGGCn:CCGAGGT(^ 
KQWLLFHMSKNSLEEKK"PTEAPP> 

710 720 730 • 740 750 760 770 

AAGAGCCACTGGACATGGAAGACCTATCTTCTGGCCTGGGAGTGACCAGGCAGGAACTGGGTCAAGTCAC 
TTCTCGGTGACCTGTACCTTCTGGATAGAAGACCGGACCCTCACTGGTCCGTCCTTGACCCAGTTCAGTG 
KEPLDMEDIiSSGLGVTRQEI>GQVT> 

780 

CCTGTGAGGTACC 
GGACACTCCATGG 
L *> 



FIGURE 3 CONTINUED 
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GCTAGCGCCGCCACCATGGGAATGCAGGTGCAGATCCAGAGCCTGTTTCTGCTCCTCCTGTGC3GTGCCCG 
CGATCGCGGCGGTGGTACCCTTACGTCCACGTCTAGGTCTCGGACAAAGACGAGGAGGACACCCACGGGC 
MGMQVQIQSLFLLI,I,WVP> 



GGTCCAGAGGAATCAGCCAGGCTGTGOICGCCGCTCACGCCGAAATCAACGAAGCTGGAAGAACCCCTCC 
CCAGGTCTCCTTAGTCGGTCCGACACGTGCGGCGAGTGCGGCTTTAGTTGCTTCGACCrrCTTG<JGGAGG 
GSRG I SQAVHAAHAEINE AGRTPP> 



AGCTTATCGCCCTCCAAACGCTCCTATCCTGTTCTTTCTGCTGACCAGAATCCTGACAATCCCCCAGTCC 
TCGAATAGCGGGAGGTTTGCGAGGATAGGACAR.GAAAGACGACTGGTCTTAGGACTGTTAGGGGGTCAGG 
AYRPPNAPILFFLLTRILTIPQSa 



CTGGACGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTAACAACATGTTGATCCCCATTGCTG 
GACCrrGCGGTTCAAGCACCGACGGACCTGGGACTTCCGACGGCGATTGTTGTACAACTAGGGGTAACGAC 
LDAKFVAAWTLKAAANNMLIPIA> 



TGGGCGGTGCCCTGGCAGGGCTGGTCCTCATCGTCCTCATTGCCTACCTCATTGGCAGGAAGAGGAGTCA 
ACCCGCCACGGGACCGTCCCGACCAGGAGTAGCAGGAGTAACGGATGGAGTAACCGTCCTTCTCCTGAGT 
VGGALAGI.VLIV L lAYL X GRKRSH> 



CGCCGGCTATCAGACCATCTAGGGTACC 
GCGGCCGATAGTCTGGTAGATCCCATGG 
A G Y Q T I *> 



FIGITOE 4 
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GCTAGCGCCGCCACCATGGCTGCACTCT(K3CTGCTGCTGCTGGTCCTCAGTCTGCACTGTATGGGGATCA. 
CGATCGCGGCGGTGGTACCGACGTGAGACCGACGACGACGACCAGGAGTCAGACGTGACATACCCCTAGT 
MAALWLLLLVIiS LHCMGI> 



GCCAGGCTGTGCACGCCGCTCACGCCGAAATCAACGAAGCTGGAAGAACCCCTCCAGGTTATCGCCCTCC 
CGGTCCGACACGTGCGGCGAGTGCGGCTTTAGTTGCTTCGACCTTCTTGGGGAGGTCGAATAGCGGGAGG 
SQAVHAAHAEINEAGRTP PAYRPP> 



AAACGCTCCTATCCTGTTCTTTCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTGGACGCCAAGTTC 
TTTGCGAGGATAGGACAAGAAAGACGACTGGTCTTAGGACTGTTAGGGGGTCAGGGACCTGCGGTTCAAG 
NAPILFFLLTRILTXPQSLDAKF> 



K3CTTCA 

CACCGACGGACCTGGGACTTCCGACGGCGATTCCAGAGACACAGACGTCGGTGGGACCCGGACCCGAAGT 
VAAWTLKAAAKVSVSAATt.GLGF> 



TCATCTTCTGTGTTGGCTTCTTCAGATGGCGCAAGTCTCATTCCTCCAGCTACACTCCTCTCCCTGGATC 
AGTAGAAGACACAACCGAAGAAGTCTACCGCGTTCAGAGTAAGGAGGTCGATGTGAGGAGAGGGACCTAG 
IIFCVG?FRWRKSHSSSyTp 'LPGS> 



CACCTACCCAGAAGGACGGCATTAGGGTACC 
GTGGATGGGTCTTCCTGCCGTAATCCCATGG 
TYPEGRH*> 



FIGURE S 
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GCTAGCGCCGCCACCATGGGCGCTGGGAGGGCCCCCTGGGTGGTCMCTCTGTTGGTGAACCrCR.TGAGGC 
CGATCGCGGCGGTGGTACCCGCGACCCTCCCGGGGGACCCACCACCGAGACARCCACTTGGAGTACTCCG 
MGAGRAPWVVALLVNLMR> 



tggattccatcagccaggctgtgc:acgccgctcacgccgaaatcaa.cgaagctggaagaacccctccagc 
accraaggtagtcggtccgacacgtgcggcgagtgcggctttagttgcttcgacctrcttggggaggtcg 

LDSISQAVHAAHAEINEAGRTPP A> 



TrATCGCCCTCCAAACGC^CCTATCCTGTTCTTrCTGCTGACCAGAATCCTGACAATCCCCCAGTCCCTG 
AATAGCGGGAGGTTTGCGAGGATAGGACRAGAAAGACGACTGGTCTTAGGACTGTTAGGGGGTCAGGGAC 
YRPPNAPILFFLLTRI LTIPQSL> 



GACGCCAAGTTCGTGGCTGCCTGGACCCTGAAGGCTGCCGCTATACTGAGTGGAGCTGCAGTGTTCCTGC 
CTGCGGTTCAAGCACCGACGGACCTGGGACTTCCGACGGCGATATGACTCACCTCGACGTCACAAGGACG 
DAKFVAAWTLKAAAILS GAAVFL> 



TTGGGCTGATTGTCTTCCTGGTGGGGGTTGTTATCCATCTCAAGGCTCAGAAAGCATCTGTGGAGACTCA 

aacccgactaacagaagaaccacccccaacaataggtagagttccgagtctttcgtagacacctctgagt 
lglivflvgvvihlkaq KASVETQ> 



GCCTGGCAATGAGAGTAGGTCCCGGATGATGGAGCGGCTAACCAAGTTCAAGGCTGGACCGGGACATGTC 
CGGACCGTTACTCTCATCCAGGGCCTACTACCTCGCCGATTGGTTCAAGTTCCGACCTGGCCCTGTACAG 
PGN-ESRSRMMERLTKFKAGPGEV> 



ACATGAGGTACC 
TGTACTCCATGG 



FIGURE 6 
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GCTAGCGCCGCCACCATGGCCAAi^TTCGTaSCrGCCTGGACCCTGAAGGCTGCCGCTATGAGTCTTCT^ 
CGATCGCGGCGGTGGTACCGGrrcaAGCailCCGACGGACCTGGGACTTCCGACGGCGATRCTavGAAGATT 
MA-CFVAAWTI<KAAAMSLL> 



CCGAGGTCGAAACGTACGrTCTCrCTATCATCCCATCAGGCCCCCTCAAAGCCGAGATCGCGCAQAGACT 
GGCrCCRGCTTrGCATGCAAGAaAGATAGTAGGGTAGTCCGGGGGAGTTTCGGCTCTAGCGCGTCTCT^ 
TEVETYVI.SIIPSGPLKAHIAQRL> 



TGAGGATGTTTTTGCAGGGAAGAiCACAGATCTTGAGGCTCTCATGGAATGGCTAAAGAaVAGACC^ 

rrGTCTAGAACTCCGAGAGTACCXTACCGATTTCTGTTCTGGTTAG 
TDLEALMEWLKTRP I> 



CTGTCACCTCTGACTAAGGGAArTTTAGGGTrrGTGTrCACGCTCACCGTGCCCAGTGAGCGAGGACTGC 
GACAGTGGAGACTGATTCCCTTAAAATCCCAAACACAAGTGCGAGTGGCACGGGTCACTCGCTCCTGACG 
LSPLTKCrL.GFVFTLTVPSERGIo 



AGCGTAGACGATTTGTCCAAAAraCCCTAAATGGGAATGGAGACCCAAACAACATGGACAGGGCAGTTAA 
TCGCATCTGCTAAACAGGTTTTACGGGATTTACCCTTACCTCTGGGTTTGTTGTACCTGTCCCGTCAATO 
QRRRFVQKALNGNGDPNNMDRAVK^ 



ACTATACAAGAAGCTGAAGAGGGAAATGACATTCCATGGAGCAAAGGAAGTTGCACTCAGTTACrCAACT 
TGATATGTTCTTCGACTTCTCCCTTTACTGTAAGGTACCTCGTTTCCTTCAACGTGAGTCAA-rGAGTTGA 



L Y K K L K R 



T F H G 



KEVALSYS 



GGTGCGCTTGCCAGTTGCATGGGCCTCATATACAACCGGATGGGAACAGTGACCACAGAAGTGGCTCrrG 
CCACGCGAACGGTCAACGTACCCAGAGTATATGTrGGCCrACCCTTGTCACTGGTGTCTTCACCGAGAAG 
GALA-SCMGLrYNRMGTVTXEVAL> 



GCCTAGTATGTGCCACTTGTGAC-CAGATTGCTGATGCCCAACATCGGTCCCACAGGCAGATGGCGACTAC 
CGGATCATACACGGTGAACACTCGTCTAACGACTACGGGTTGTAGCCAGGGTGTCCGTCTACCGCTGATG 
GLV CATCE<2IADAQHRSHRQMATT> 



CACCAACCOVCTAATCAGGCATGAGAACAGAATGGTACTAGCCAGCACTACGGCTAAGGC^ 
GTGGTTGGGTGATTAGTCCGTACTC 



V I, A S 



T T 
680 



K A 
590 



ATGGCTGGATCAAGTGAGCAGGCAGCAGAGGCCATGGAAGTCGCMGTCRGGCTAGACAAATGGTGCAGG 
TACCGACCTAGTTCACTCGTCtxrrCGTCTCCGGTACCTTCAGCGTTCAGTCCGATCnXSTTrACa^COT^ 
MAGSSEQAAEAMEVASQARQMVQ? 
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720 730 740 7S0 



CAATGAGGAaUVTTGGGACTCACCCTAGCTGCAGTGaWSOTCTAAAAGATGATCTTATTGAAAA^ 

GTTACTCCTGrrAAGCCTGAGTGGGATCGAGGTCACGTCCAGATTTTCTACTAGAATAACTTTTAAACGT 

AMaTIGTHPSSSAGLKDDLIENLQ> 

730 790 800 SXO 



GGCTTACCAGAAACGGATGGGGGTGCAGATGCAGCGATTCAAGTGA 
CCGAATGGTCTTTGCCTACCCCCACGTCTACGTCGCTAAGTTCACT 
AYQKRMGVQMQRFK*> 



ETGORE 7 CONTINOED 
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GCTAGCGCCGCCACCATGGCCAAGrrCGTGGCTGCCTGGACCCTGAR.GGCTGCCGCTCTCGAGATTGGG<3 
CGATCGCGGCGGTGGTACCGGTTCAAGCACCGACGGACCTGGGACTTCCGACGGCGAGAGCTCTAACCCC 
MAKFVAAWTLKAAALEIG> 



GACCCrGCCTGAACGCCGAGAACATCAC».TCAGGATTCCTAGGACCCCTrCTCCTGTTACAGGCGGGGTT 
CTGGGACGGACTTGCGGCTCTTGTAGTGTAGTCCTAAGGATCCTGGGGAAGAGCACAATGTCCGCCCCAA 
GPCLKAENrTSGFLGPLI-VI.QAGF> 



TTTCTTGTTGACAAGAATCCTCACM.TACCGCAGAGTCTAGACTCGTGGTGGACTTCTCTCAATTTTCTA 
AAAGAACAACTGTTCrTAGGAGTGTTATG<3CGTCTaVGATCrGAGav.CCACCrGAAGAGAGTTAAAAGAT 
FLLTRILTIPQSLDSWWTSLNFL> 



GGGGGAACTACCGTGTGTCTTGGCCAAAATTCGCAGTCCCCAACCTCCAATCACTCACCAACCTC^ 
CCCCCTTGATGGCACACAGAACCGGTTTTAAGCGTCAGGGGTTGGAGGTTAGTGAGrGGTTGGAGAfl.CAG 
GGTTVCL.GQKSQSPTSNKS PrsC> 



CrCCAACTTGTCCTGGXTATCGCTGGATGTGTCTGCGGCGTTTTATCATCTTCCTCTTCATCCTGCTGCT 
GAGGTTGAACAGGACCAATAGCGACCTACACAGACGCCGCAAAATAGTAGAAGGAGAAGTAGGACGACGA 
PPTCPGYRWMCLRRFI IFLFILLLs. 



ATGCCTG 

TACGGAGTAGAAGAACAACCAAGAAGACCTGATAGTTCCATACAACGGGCAAACAGGAGATTAAGGTCCT 
CLIFLLVLLDYQ<3MLPVCPI.IPG> 

430 440 450 460 470 480 490 

TCCTCAACAACCAGCACGGGACCATGCCGGACCTGCATGACTACTGCTCAAGGAACCTCTATGTATCCCT 

SSTTSTGPCRTCMTTAQG TSMYP> 

500 510 S20 530 540 SSO SSQ 



GGACAACGACATGGTTTGGAAGCCTGCCTTrAACGTGGACATAAGGGTAGGGTAGTAGCACCCGAAAGCC 
SCCCTKPSDGNCTCIPIP SSWAFG> 



a.TTTGTTCAGTGG 

ttttaaggataccctcacccggagtcgggcaaagaggaccgagtcaaatgatca.cggtaaacaa^^ 
kfi.wewa'sarfshlsllvpfvqw> 
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640 650 660 670 630 590 700 

TTCGTAGGGCTTTCCCCCACTGTTTGGCTTTCAGTTATATGGATGATGTGGTATTGGGGGCCaAGTCTGT 
AAGO^TCCCGAAAGGGGGTGACAAACCGAAAGTCAATATACCTACTACACCATAACCCCCGGTTCAGACA 
FVGLS PTVWLSVIWMMHYWGPSL> 



710 720 730 740 750 760 770 

ACAGCATCTTGAGTCCCrTTTTACCGCTGTTACCAATTTTCTrTTGTCTTTGGG^^ 

TGTCGTAGAACTCAGGGAAAAATGGCGACAATGGrrAAAA.GAAAA.CAGAAACCCATATGTAAATTTGGGA 

YSII.SPFLPLLPrFFCIiWVYI*> 



780 790 800 

AACAAAACAAAGAGATGGGGTTACTCTCTAA 
TTGTTTrGTTTCTCTACCCCAATGAGAGATT 



FIGURE 8 COtrriNUED 
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10 20 30 40 50 60 70 



CSCTAGCGCCGCCACCATGCCAGGCXXrrCTAGAAGCCCrCAGACCCC^ 
CGATCGCGGCa3TCK2TACCGTCCCCCAGAlxrrrCGCX:A<nXrr^ 

HPGGLEAI.RAL S 'L U L F L. S> 

ao 90 100 no 120 130 140 



ACCCCTGTTIXXXrrCCCCGATGCCAGGCCATCAGCCAGGCXGTGCACGCCGCTC 
TGCCX^CAAACCCAGGGCCTAO^GTCCCKTAGTCCXrr^ 

yACLGPGCQAIS QAVHAAHAEIHE> 

150 160 170 ISO ISO 200 210 

AGCTGGAAGAACCCCTCCACCrTATCGGCCTCCAAAa3C^ 

TCca.ccrrcrixxxx;AGGTCcy^ATAGCGGGACcr^^ 

AGRTPPAYRP PNAP Il.FFLLTRr> 

220 230 240 250 200 270 280 

CTCyuiAATCCCCCAGTCCCTCCACGCCAAGTTCGTOS^^ 
GACTCTTAGGCKSTrCAGGGACCTGCGGTTCAAGCACCaAOSC^ 
L TrpQSt.DAKFVAAWTLKAAAGI> 

290 300 310 320 330 340 3S0 

TCTTGCTGTTCTGTGCAGTGGTGCCAGGGACGCTGCT^^ 

AGAACGACAAGACACCrrCACCACGGTCCCTGCCACGACGATAAGTCCTrrTXKrCACCGTT^ 
2LLFCAV'VPGTLL.LFRKRWQNEKF> 

360 370 330 390 400 410 420 

TQSGGTCGACATCCCAGATGACTATGAAGATGAAAATCTCTATGAGGG^ 
ACCCC^CCTCrrACCGTCTACTGATACTTCTACTTTTAGAGATACTCC^ 

G V D M P D D Y E D E N L Y E G L. M L D D.C S> 

430 440 4S0 4S0 470 430 490 

ATCTATGAGGACATCTCCAGGGGACTCCAGGGCACCTACCAGGAtGTt^^ 
TACATACTCCriXn'AGAGGTCICCCTGAGGTCCCGTXXJATGGTCCTACACCCGTT^ 
MY E D I S RG L Q GTY Q D V G N L H I G D> 

SCO SIO 



CCCAGCTGGAAAAGCCATGAGGTACC 
GGGTCGACCTTTTCGGTACTCCATGG 
A <a L E K P *> 



FIGURE 9 
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10 20 30 40 ■ SO 60 70 

GCTAGCGCCGCCACCATCXX:CACACI02TGCrCrTCT^ 
CGATCGCCKKIGGTGXSTACCCKTrOTGACCACGACAGAAGGTACGGGACGGTC 

MATLVLSSM PC K ^ L h T U L> 

80 90 100 110 120 130 140 



TCKrrCTTCTCAGGTGAGCCGATCAGCCAGGCTGTGCACGCCGCT^ 
ACGAGAAGAGTCCACTCGGCTAGTOSGTCCGACACGTGCQSCGAGTGCGGCT^ 

LLFSGEP1SQAVHAAHAEINEAGR> 

150 ISO 170 130 190 200 210 

AACCCCICCAGCTTATCGCCCrCCAAACGCTCCTATCCTGTXX^^ 
TltXXXSAGGTCGAATAGCGGGAGGTTTCCGAGGATAGCACAAGAAAGACaAC^^ 

TPPAYRPPNAPILFF1.LTRI1.TI> 

220 230 240 2S0 250 270 280 

CCCCAGTCCCTSGACGCCAAGTTCXSTGGCTGCCIGGACCC^ 

GGGGTCAGGGACCTGCGGTTCAAGCACCGACGGACCTGGGACrrCCGACGGCGATAATAGAACTAGGT^ 
P QSLDAKFVAAWTLKAAAI I^L.^ I Q> 

290 300 310 320 330 340 350 



CCCTCCTCATCATCCTCTTCATCATTGTGCCCATCrTCCTGCTAC:^ 

GGGAGGAGTAGTAGGAGAAGTAGTAACACGGGTAGAAGGACGATGAACTGrrcCTACTGCCGTTCCGACC 
' Tr- L L ■■ r" X. L f"' X I V p I.r F L L L D K D D G K A G> 

3S0 370 380 390 400 410 420 

GATGGAGGAAGATCACACCTATGAGGGCTtGAACATTGACCAGACAGCCACCTATGAAGACATAGl^ 
• CTACCrcCTIxrrAGTGTGGATACTCCCGAACTlGTAACTGGTCXGX^^ 

MEEDHTYEGLNIDQTATYEDIVT> 

430 440 450 4G0 470 480 

CTTCGGACAGGGGAGGTAAAGTGGTCGGTAGGAGAGCATCCAGGCCAGGAATGAC^ 
GAAGCCTGTCCCCTCCATTTCACCAGCCATCCTCTCGTAGGTCCGGTCCXTACTCC^ 
LRTGEVKWSVGEHPGQE*> 



FIGURE 10 
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CGAT03CG^.CGGTCKTACCCTTACGTCCACGTCTAGGTCTCGGACAAAGACGA<^^ 

HGMQVQIQ SLFLLLLWVP> 



CXrrCCCGAGGAATCAQrCAGGCTGTGCACGCCCXriXACCxrCGAAATC^ 

ccArxs3CTCcrrAOTccJ3TcccACAccnaccxx:GA£^ 

GS RGISQAVH.AAHAE INEAGRTPP> 



AGCnrTATCCK:ccrCOL?ACGCTCCTAT^ 

TCGAATAGCGGGAGGTTTCKCAGGATACXSACAAGAAAGACGACTC^ 

AYRPPNAPILFFLLTRILTIPQS> 



CTCX3ACGCCAAGTTCCrrtS3CTCXCTCX;ACCCTGAAGCSr^ 
GACCrrGCGGrrCAAC^ACCGACGGACCTGGGACTTCCGACCK;CGAACTCC^ 
LDAKFV AAWTLK AAA »> 



FIGURE 11 
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TTCCCAG ATG CAC AGG AGG AGA AGC AGG AGC TGT CGG GAA GAT CAG AAG 
Met His Arg Arg Arg Ser Arg Ser Cys Arg Glu Asp Gin Lys 



CCA GTC ATG GAT GAC CAG CGC GAC CTT ATC TCC AAC AAT GAG CAA CTG 
Pro Val Met Asp Asp Gin Axg Asp Leu He Ser Asn Asn Glu Gin Leu 



CCC ATG CTG GGC CGG CGC CCT GGG GCC CCG GAG AGC AAG TGC AGC CGC 
Pro Met Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg 



GGA GCC CTG TAC ACA GGC TTT TCC ATC CTG GTG. ACT CTG CTC CTC GCT 
Gly Ala Leu Tyr Thr Gly Phe Ser He Leu Val Thr Leu Leu Leu Ala 



GGC CAG GCC ACC ACC GCC TAC TTC CTG TAC CAG CAG CAG GGC CGG CTG 
Gly Gin. Ala Thr Thr Ala Tyr Phe Leu Tyr Gin Gin Gin Gly Arg Leu 



GAC AAA. CTG ACA GTC ACC TCC CAG AAC CTG CAG CTG GAG AAC CTG CGC 
Asp Lys Leu Thr Val Thr Ser Gin Asn Leu Gin Leu Glu Asn Leu Arg 



ATG AAG CTT CCC AAG CCT CCC AAG CCT GTG AGC AAG ATG CGC ATG GCC 
Met Lys 'Leu Pro Lys Pro Pro Lys Pro Val Ser Lys Met Axg Mat Ala 
95 100 105 110 

ACC CCG CTG CTG ATG CAG GCG CTG CCC ATG GGA GCC CTG CCC CAG GGG 
Thr Pro Leu Leu Met Gin Ala Leu Pro Met Gly Ala Leu Pro Gin Gly 
115 120 12S 

CCC ATG CAG AAT GCC ACC AAG TAT GGC AAC ATG ACA GAG GAC CAT GTG 
Pro Met Gin Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp His Val 
,130 135 140 

ATG CAC CTG CTC CAG AAT GCT GAC CCC CTG AAG GTG TAC CCG CCA CTG 
Met His Leu Leu Gin Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu 
145 150 155. 

AAG GGG AGC TTC CCG GAG AAC CTG AGA CAC CTT AAG AAC ACC ATG GAG 
Lys Gly Ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu 
ISO 155 X70 

ACC ATA GAC TGG AAG GTC TTT GAG AGC TGG ATG CAC CAT TGG CTC CTG 
Thr He Asp Trp Lys Val Phe Glu Ser Trp Met His His Trp Leu Leu 
175 130 18S 190 



FIGaRE 12 
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TTT GAA ATG AGC AGG CAC TCC TTG GAG CAA AAG CCC ACT GAC GCT CCA S25 
Phe Glu Met Ser Arg Kis Ser Leu Glu Gin Lys Pro Ttir Asp Ala Pro 
135 200 205 

CCG AAA GAG TCA CTG GAA CTG GAG GAC CCG TCT TCT GGG CTG GGT GTG S73 
Pro Lys Glu Ser Leu Glu Leu Glu Asp Pro Ser Ser Gly Leu Gly Val 
210 21S 220 

ACC AAG CAG GAT CTG C-GC CCA GTC CCC ATG TGAGAGCAGC AGAGGCGGTC 723 
Thr Lys Gin Asp Leu Gly Pro Val Pro Met 
225 230 
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CCGCCTCGGC ATG GCG CCC CGC AGC GCC CGG CGA CCC CTG CTG CTG CTA 
Met Ala Pro Arg Ser Ala Axg Arg Pro Lau Lau Leu Leu 



CTG COT GTT GCT GCT GCt CGG CCT CAT GCA TTG TCG TCA <3CA GCC ATG 
Leu Pro val Ala Ala Ala Arg Pro His Ala Leu Ser Ser Ala Ala Met 



TTT ATG GTG AAA AAT GGC AAC GGG ACC GCG TGC ATA ATG GCC AAC TTC 
Phe Met Val Lys Asn Gly Asn Gly Thr Ala Cys lie Met Ala Asn Phe 



TCT GCT GCC TTC TCA GTG AAC TAG GAC ACC AAG AGT GGC CCC AAG AAC 
Ser Ala Ala Phe Ser Val Asn Tyr Asp Thr Lys Ser Gly Pro Lys Asn 



ATG ACC TTT GAC CTG CCA. TCA GAT GCC ACA GTG GTG CTC AAC CGC AGC 
Met Thr Phe Asp Leu Pro Ser Asp Ala Thr Val Val Leu Asn Arg Ser 



TCC TGT GGA AAA. GAG AAC ACT TCT GAC CCC AGT CTC GTG ATT GCT TTT 
Ser Cys Gly Lys Glu Asn Thr Ser Asp Pro Ser Leu Val lie Ala Phe 



GGA AGA GGA CAT ACA CTG AGT CTC AAT TTC ACG AGA AAT GCA ACA CGT 
Gly Arg Gly His Thr Leu Thr Leu A«n Phe Thr Arg Asn Ala Thr Arg 
35 100 105 

TAG AGC GTT CAG CTC ATG AGT TTT GTT TAT AAC TTG TCA GAC ACA CAC 
Tyr Ser Val Gin Leu Mat Ser Phe Val Tyr Asn Leu Ser Asp Thr His 
110 lis 120 12S 

CTT TTG CCC AAT GCG AGC TCC AAA GAA ATC AAG AGT GTG GAA TCT ATA 
Leu Phe Pro Asn Ala Ser Ser Lys Glu lie Lys Thr Val Glu Ser lie 
130 135 140 

ACT GAC ATC AGG GCA GAT ATA GAT AAA AAA TAG AGA TGT GTT AGT GGC 
Thr Asp He Arg Ala Asp He Asp Lys Lys Tyr Arg Cys Val Ser Gly 
145 ISO 1S5 

ACC CAG GTC CAC ATG AAC AAC GTG ACC GTA ACG CTC CAT GAT GCC ACC 
Thr Gin Val His Met Asn Asn Val Thr Val Thr Leu His Asp Ala Thr 
ISO les 170 

ATC CAG GCG TAC CTT TCC AAC AGC AGC TTC AGC AGG GGA GAG ACA CGC 
lie Gin Ala Tyr Leu Ser Asn Ser Ser Phe Ser Arg Gly Glu Thr Arg 
175 130 185 
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TGT GAA CAA GAC AGG CCT TCC CCA ACC ACA GCG CCC OCT GCG CCA CCC 
Cya Glu Gin Asp Axg Pro Ser Pro Thr Thr Ala Pro Pro Ala Pro Pro 
190 ISS 200 20S 

AGC CCC TCG CCC TCA CCC GTG CCC AAG AGC CCC TCT GTG GAC AAG TAC 
Ser Pro Ser Bxo Ser Pro Val Pro Lys Ser Pro Ser Val Asp Lys Tyr 

210 21S 220 

AAC GTG AGC GGC ACC AS.C GGG ACC TGC CTG CTG GCC AGC ATG GGG CTG 
Asn Val Ser Gly Thr Asn Gly Thr Cys Leu Lea Ala Ser Met Gly Leu 
225 230 235 

CAG CTG AAC CTC ACC TAT GAG AGG AAG GAC AAC ACG ACS GTG ACA AGG 
Gin. Lau Asn Leu Thr Tyr Glu Arg Lys Asp Asn Thr Thr Val- Thr Arg 
240 24S ' 2S0 

CTT CTC AAC ATC AAC CCC AAC AAG ACC TCG GCC AGC GGG AGC TGC GGC 
Leu Leu Asn lie Asn Pro Asn Lys Thr Ser Ala Ser Gly Sar Cys Gly 
2SS 250 2SS 

GCC CAC CTG GTG ACT CTG GAG CTG CAC AGC GAG GGC ACC ACC GTC CTG 
Ala iiis Leu Val Thr Leu Glu Leu His Ser Glu Gly Thr Thr Val Leu 
270 275 280 285 

CTC TTC CAG TTC GGG ATG AAT GCA AGT TCT AGC CGG TTT TTC CTA CAA 
Leu Pha Gin Phe Gly MeC Asn Ala Ser Ser Ser Arg Phe Phe Leu Gin 
290 295 300 

GGA ATC CAG TTG AAT ACA ATT CTT CCT GAC GCC AGA GAC CCT GCC TTT 
Gly lie Gin .Leu Asn Thr lie Leu Pro Asp Ala Arg Asp Pro Ala Phe 
305 310 315 

AAR. GCT GCC AAC GGC TCC CTG CGA GCG CTG CAG GCC ACA GTC GGC AAT 
Lys Ala Ala Asn Gly Ser Leu Arg Ala Leu Gin Ala Thr Val Gly Asn 
320 325 330 

TCC TAC AAG TGC AAC GCG GAG GAG CAC GTC CGT GTC ACG AAG GCG TTT 
ser Tyr Lys Cys Asn Ala Glu Glu His Val Arg Val Thr Lys Ala Phe 



TCA GTC AAT ATA TTC AAA GTG TGG GTC CAG GCT TTC AAG GTG GAA GGT 
Ser Val Asn lie Phe Lys Val Trp Val Gin Ala Phe Lys Val Glu Gly 
350 355 360 3SS 

GGC CAG TTT GGC TCT GTG GAG GAG TGT CTG CTG GAC GAG AAC AGC ACG 
Gly Gin Phe Gly Ser Val Glu Glu Cys Leu Leu Asp Glu Asn Ser Thr 
370 375 330 
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CTG ATC CCC ATC GCT GTG GGT GGT GCC CTG GCG GGG CTG GTC CTC ATC 
Leu He Pro He Ala Val Gly Gly Ala Leu Ala Gly Leu Val Leu lie 



GTC CTC ATC GCC TAG CTC GTC GGC AGG ARG AGG AGT CAC GCA GGC TAC 
Val Leu lie Ala Tyr Leu Val Gly Arg Lys Arg Ser His Ala Gly Tyr 
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10 20 30 40 50 60 70 

ATGATO^CaiTCCTGCCGCTGCTGCTGGGGCTCaGCCTGGGCTGCAaAGGAGCRGGTGGCTTCGTGGCCC 
TACTAGTGTAAGGACGGCGACGACGACCCCGAGTCGGACCCGACGTGTCCTCGTCCACCGAAGCACCGGG 
MITFLPI,LI.GLSLGCTGAGGFVA> 



ao 90 100 110 120 130 140 

ATGTGGAAAGCACCTGTCTGTTGGATGATGCTGGGACTCCAAAGGATTTCACATACTGCATCTCCrTCAA 
TA(:a.CCrrTCGTGGACAGACAACCTACTACGACCCTGAGGTTTCCTAAAGTGTATGACGTAGAGGAAGTT 
HVESTCLI.DDAGTPKDFTYCISFN> 



ISO ISO 170 180 190 200 210 

C^AGGATCTGCTGACCTGCrGGGATCCAGAGGAGAATAAGATGGCCCCTTGCGAATTTGGGGTGCTGAAT 
GTTCCTAGACGACTGGACGACCCrAGGTCTCCTCTTATTCTACCGGGGAACGCTTAAACCCCACGACrrA 
KDLI.TCWD PEEN-KMAPCEFGVLN> 



220 233 240 250 260 270 230 

AGCTTGGCGAATGTCCTCTCAO^GCACCTCAACCAAAAAGACACCCTGATGCAGCGCTTGCGCAATGGGC 
TCGAACCGCTTACAGGAGAGTGTCGTGGAGTTGGTTTTTCTGTGGGACTACGTCGCGAACGCGTTACCCG 
SLANVL3QHLNQ'KDTLMQRL RNG> 



290 300 310 320 330 340 350 

ttcagaattgtgccacacacyjlcccagcccttctggggatcactgaccaacaggacacggccaccatctgt 
aagtcttaacacggtgtgtgtgggtcgggaagacccctagtgactggttgtcctgtgccggtggtagaca 
lqncathtqpfwgsltnrtri>psv> 



360 370 380 390 400 410 420 

GCAAGTAGCCAAAACCACTCCTTTTAACACGAGGGAGCCTGTGATGCTGGCCTGCTATGTGTGGGGCTTC 
CGTrCATCGGTTTTGGTGAGGABAATTGTGCTCCCTCGGACACTACGACCGGACGATACACACCCCGAAG 
QVA KTTPFNTREPVMLACYVWGF> 



430 440 450 460 470 480 490 

TATCCAGCAGAAGTGACTATCACGTGGAGGAAGAACGGGAAGCTTGTCATGCCTCACAGCAGTGCGCACA 
ATAGGTCGTCTTCACTGATAGTGCACCTCCTTCTTGCCCTTCGAACAGTACGGAGTGTCGTCACGCGTGT 
YPAEVTITWRKKGKLVMPHS SAH> 



SOO 510 520 530 540 550 560 

AGAOTGCCCAGCCCAATGGAGACTGGACATACCAGACCCTCTCCCATTTAGCCrTAACCCCCTCTTACGG 
TCTGACGGGTCGGGTTACCTCTGACCTGTATGGTCTGGGAGAGGGTAAATCGGAATTGGGGGAGAATGCC 
KTAQPNGDWTYQTLSHLALT PSYG> 
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GC3ACa.CTTACACCrGTGTGGTAGa.GCaa^Tr03GGCTCCTGAGCCCATCCTTCGGGAC^ 
CCTGTGAATGTGGACACACCATCTCGTGTAA.CCCCGAGGACTCGGGTAGGAAGCCCTGACCTGTGGACCC 
DTYTCVVSHIGAPEPILRDWTPO 



GACAGGGGGTACGTCTGGGACTrCCAAAGACACAGACGTCACTGAGACCCGGACCCGGAGTAGTAGAAGA 
LSPMQTl4KVSVSAVTLGLGLIIF> 



710 720 730 740 7S0 760 770 

CrCTTGGTGTGATCAGCTCK3CGGAGAGCTGGCCaCTCTA3TrACACTCCTCTTCCTGGGTCCAATTATTC 
GAGAACCACACTAGTCGACCGCCTCTCGACCGGTGAGATCAATGTGAGGAGAAGGACGCAGGTTAATAAG 
SL GVISWR RAGHSSYTPI.PGSNyS> 



780 730 

AGAAGGATGGCACAnTCCTAG 
TCTTCCTACCGTGTAAAGGATC 
E G W H 1 S *> 
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ATGGGTTCTGGGTGGGTCCCCTGGGTGGTGGCTCTGCTAGTGAATCTGACCCSUVCTGGATTCCTCC^^ 
TACCCAAGACCCACCCAGGGGACCCACCACCGAGACGATCACTTAGACTGGGTTGACCTAAGGAGGTACT 
MGSGWVPWvVALLVNIiTQLDS SM> 



CTClUlGGCACAGACTCrCCAGAAGATTTTGTGATTCa.GGCAAAGGCTGACTGTTACTrCACC^ 
GAGTTCCGTGTCTGAGAGGTCrrTCTAAAACACTAAGTCCGTTrCCGACTGACAATGAAGTGGTTGCCCTG • 
rQGTDSPEDFVIQAKADCYFTNGT> 



AGAAAAGGTGCAGTTTGTGGTCAGATTCATCrrTAACTTGGAGGAGTATGTACGTTTCGACAGTGATGTG 
TCTTrTCCACGTCAAACACCAGTCTAAGTAGAAiTTGAACCrCCTCATACA,TGCAAAGCTGTCACTACAC 
EKVQFVV RFI FNL.E E yV RFDSDV> 



GGGATGriT:GTGGC7.TTGACCAAGCTGGGGCAGCCAGATGCTGAGCAGTGGAACa.GCCGGCTGGATCTCT 
CCCTACAAS.CACCGTAA.CTGGTTCGACCCCGTCGGTCTACGACXCGTCACCTTGTCGGCCGACCTAGAGA 
GMFVALTKLGQPDAEQWNSRLDL> 



TGGAGAGGAGCAGACAGGCCGTGGATGGGGTCTGTAGACACAACTACAGGCTGGGCGCACCCTTCACTGT 
ACCTCTCCTCGTGTGTCCGGCACCTACCCCAGACATCTGTGTTGATGTCCGACCCGCGTGGGAAGTGACA 
LERSRQAVDGVCRHNyE.riGAP?TV> 



GGGGAGAAAAGTGCAACCAGAGGTGACAGTGTACCCAGAGAGGACCCCACTCCTGCACCAGCATAATCTG 
CCCCTCTrrTCACGTTGGTCTCCACTGTCACATGGGTCTCTCCTGGGGTGAGGACGTGGTCGTATTAGAC 
GRXVQPEVTVYPERTPLLHQHNL> 



CTGCACTGCTCTGTGACAGGCTTCTATCCAGGGGATATCAAGATCAAGTGGTTCCTGAATGGGCAGGAGG 
GACGTGACGAGACACTGTCCGAAGATAGGTCCCCTATAGTTCTAGTTCACCAAGGACTTACCCGTCCTCC 
LHCSVTGFYPGDIKI KWFLNGQEs 
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AGAGAGCTGGGGTCATGTCCACTGGCCCTATCAGG2\ATGGAGACTGGACCTrTCA,(3ACTGTGGTGAXGCT 
TCTCTCGACCCCAGTACAGGTGACCGGGATAGTCCTTACCTCTGACCTGGAAAGTCTGACACCACTACGA 
E'RAGVMSTGPrRNGDWTFQTVVML> 



AGAAATGACTCCTGAACrTGGACATGTCrACACCTGCCTTGTCGATCACTCCAGCCTGCrGAGCCCTGTT 
TCTTTACTGAGGACTTGAAGCTGTACAGATGTGGACGGAACAGCTAGTGAGGTCGGACGACTCGGGACAA 
EMTPEL.GHVYTCLVDHSSLLSPV> 



TCTGTGGAGTGGAGAGCTCAGTCrGAATATTCTTGGAGAAAGATGCrGAGTGGCATTGCAGCCrrCCTAC 
AGACACCTCACCTCTCGAGTCAGACTTATAAGAACCTCTTTCTACGACTCACCGTAACGTCGGAAGGATG 
SVEMRAQSEYSWRKMLSGIAAFL> 



TTGGGCTAATCTTCCTTCTGGTGGGAATCGTCATCCAGCTAAGGGCTCAGAAAGGATATGTGAGGACGCa 
AACCCGATTAGAAGGAAGACCaCCCTTAGCAGTAGGTCGATTCCCGAGTCTTTCCTATACACTCCTGCGT 
LGLIFL LVGrVIQLRAQKGYVRTQ> 



GATGTCTGGTAATGAGG7CTCAAGAGCTGTTCTGCTCCCTCAGTCATGCTAA 
CTACAGACCATTACTCCAGAGtTCTCGACAAGACGAGGGAGTCAGTACGATT 
M SGNEVSRAVLLPQSC*> 
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ATGCCTGGGGGTCCAGGAGTCCTCCAAGCTCTGCCTGCCACCATCTTCCTCCTCTTCCTGCTGTCTGCTG 
TACGGACCCCCAGGTCCTCAGGAGGTTCGAGACGGACGGTGGTAGAAGGAGGAOAAGGACGACAGACGAC 
MPGGPGVLQALPATIPLLFL,]:.SA> 



TCTACCTGGGCCCTGGGTGCCAGGCCCTGTGGATGCACAAGGTCCCAGCATCATTGATGGTGAGCCTGGG 
AGATGGACCCGGGACCCACGGTCCGGGACACCTACGTGTTCCAGGGTCGTAGTAACTACCACTCGGACCC 
VYLGPGCQALWMHKVPAS LMVSLG> 



GGAAGACGCCCACTTCCAATGCCCGCACAATAGCAGCAACAACGCCAACGTCACCTGGTGGCGCGTCCTC 
CCTTCTGCGGGTGAAGGTTACGGGCGTGTTATCGXCGTTGTTGCGGTTGCAGTGGACCACCGCGCAGGAG 
EDAHFCCPHNSSNNANVTWWRVL> 



CATGGCAACTACACGTGGCCCCCTGAGTTCTTGGGCCCGGGCGAGGACCCCAATGGTACGCTGATCATCC 
GTACCGTTGATGTGCACC3GGGGACTCAAGAACCCGGGCCCGCTCCTGGGGTTACCATGCGACTAGTAGG 
KGNYTWPPEFLGPGEDPNGTLII> 



agaatgtgaacaagagcca7gggggcatatacgtgtgccgggtccaggagggcaacgagtcataccagca 
tcttacacttgttctcggtacccccgtatatgcacacggcccaggtcctcccgttgctcagtatggtcgt 
qnvnks:-:ggiyvcrvqegnesyqq> 



GTCCTGCGGCACCTACCrCCGCGTGCGCCAGCCGCCCCCCAGGCCCTTCCTGGACATGGGGGAGGGCACC 
CAGGACGCCGTGGATGGAGGCGCACGCGGTCGGCGGGGGGTCCGGGAAGGACCTGTACCCCCTCCGGTGG 
SCGTYLRVRQPPPRPFI-DMGEGT^ 



AAGAACC6AATCATCACAGCCGAGGGGATCATCCTCCTGTTCTGCGCGGTGGTGCCTGGGACGCTGCTGC 
TTCTTGGCTTAGTAGTGTCGGCTCCCCTAGTAGGAGGAGAAGACGCGCCACCACGGACCCTGCGACGACG 
KNRIITAEGrrLLFCAVVPGTLL> 
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500 SlO 520 530 540 SSO 560 

TGTTCaGGAAACGATGGCAGAA,CGAGAAGCTCGGGTTGGATGCCGGGGATGAaTATGAAGATGAAAACCT 

ACAAGTCCTTTGCTACCGTCTTGCTCTTCGAGCCCAACCTACGGCCCCrACTTATACTTCTACTT^ 

LFRKRWQNSKLGI,DAGr)EYEDENL> 

570 530 590 600 610 620 630 



TTATGAAGGCCTGAACCTGGACGACTGCTCCATGTATGAGGACATCTCCCGGGGCCTCCAGGGCACCTAC 
AATACTTCCGGACrTGGACCTGCTGACGAGGTACATACrCCTGTAGAGGGCCCCGGAGGTCCCGTGGATG 
YEGLNLDDCSMYED I S RGL,QGTy> 

640 650 660 570 680 690 700 

CaGGATGTGGGCAGCCTCAACATAGQAGATGTCCAGCTGGAGAAGCCGTGACACCCCTACTCCTGCCAGG 
GTCCTACACCCGTCGGAGTTGTATCCTCTACAGGTCGACCTCTTCGGCACTGTGGGGATGAGGACGGTCC 
QDVGSLNIGDVQLEKP*> 
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GAATTCCGCG GTGACC ATG GCC AGG CTG GCG TTG TCT CCT GTG CCC AGC 
MeC Ala Arg Leu Ala Leu Ser Pro Val Pro Ser 



CAC TGG ATG GTG GCG TTG CTG CTG CTG CTC TCA GCT GAG CCA GTA CCA 
His Trp Met Val Ala Leu Leu Leu .Leu Leu Ser Ala Glu Pro Val Pro 



GCA GCC AGA TCG GAG GAC CGG TAG CGG AAT CCC AAA GGT AGT GCT TGT 
Ala Ala Arg Ser Glu Asp Arg Tyr Arg Asn Pro Lys Gly Ser Ala Cys 



TCG CGG ATC TGG CAG AGC CCA CGT TTC ATA GCC AGG AAA CGG CGC TTC 
Ser Arg lie Trp Gin Ser Pro Arg Phe lie Ala Arg Lys Arg Arg Phe 



ACG GTG AAA ATG CAC TGC TAC ATG AAC AGC GCC TCC GGC AAT GTG AGC 
Thr Val Lys Mec His Cys Tyr Met Asn Ser Ala Ser Gly Asn Val Ser 



TGG CTC TGG AAG CAG GAG ATG GAC GAG AAT CCC CAG CAG CTG AAG CTG 
Trp Leu Trp Lys Gin Glu Met Asp Glu Asn Pro Gin Gin Leu Lys Leu 



GAA AAG GGC CGC ATG GAA GAG TCC CAG AAC GAA TCT CTC GCC ACC CTC 
Glu Lys ■ Gly Arg Met Glu Glu Ser Gin Asn Glu Ser Leu Ala Thr Leu 
95 XOO lOS 

ACC ATC CAA GGC ATC CGG TTT GAG GAC AAT GGC ATC TAC TTC TGC CAG 
Thr lie Gin. Gly lis Arg Phe Glu Asp Asn Gly lie Tyr Phe Cys Gin 
110 lis 120 

CAG AAG TGC AAC AAC ACC TCG GAG GTC TAC CAG GGC TGC GGC ACA GAG 
Gin Lys Cys Asn Asn Thr Ser Glu Val Tyr Gin Gly Cys Gly Thr Glu 
125 130 13S 

CTG CGA GTC ATG GGA TTC AGC ACC TTG GCA CAG CTG AAG CAG AGG AAC 
Leu Arg Val Met Gly Phe Ser Thr Leu Ala Gin Leu Lys Gin Arg Asn 
140 145 ISO 155 

ACG CTG AAG GAT GGT ATC ATC ATG ATC CAG ACQ CTG CTG ATC ATC CTC 
Thr Leu Lys Asp Gly lie lie Met lie Gin Thr Leu Leu lie lie Leu 
160 155 170 
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TTC ATC ATC GTG CCT ATC TTC CTG CTG CTG GAC AAG GAT GAG AGC AAG S77 
Phe lie lie Val Pro lie Phe Leu Leu Leu Asp Lys Asp Asp Ser Lys 
17S 180 IBS 

GCT GGC ATG GAG (3AA GAT CAC ACC TAG GAG GGC CTG GAC ATT GAC CAG 625 
Ala Gly Met Glu Glu Asp His Thr Tyr Glu Gly Lau Asp Xle Asp Gin 
190 19S 200 

ACA GCC ACC TAT GAG GAC ATA GTG ACG CTG CGG ACA GGG GAA GTG AAG 673 
Thr Ala Thr Tyr Glu Asp He Val Thr Leu Arg Thr Gly Glu Val Lys 
205 210 215 

TGG TOT GTA GGT GAG CAC CCA GGC CAG GAG TGAGAGCCAG GTCGCCCCAT 723 
Trp Ser Val Gly Glu His Pro Gly Gin Glu 
220 22S 230 
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I Signal seq.|T hefper epitope st ring Transmembrane | O/t opiasmic 
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10 2C 30 40 SO 60 70 

GACGGATCGGGAGATCrCCCGA.TCCCCrATGGTCGACTCrCAGTAaUVTCrGCT 

CTGCCTAGCCCTCTAGAGGGCTAGGGGATACCAGCTOAGAGTCATGTTAGACGAGACTACGGCGTATa^ 

80 90 100 110 120 130 140 

AAGCa\.GTATCTGCTCCC:rGCTTQTGTGTTGGAGGTCGCrC5AGTAGTGCGCGAGCAAAATTTAAGCTA 
TTCGGTCATAGACGAGGGACGAACACACAACCTCCAGCGACrCATCACGCGCTCGTTrrAAATTCG^ 

150 ISO 170 180 190 200 210 

ACAAGGay^.GGCrrGACCGACAATTGaiTGAAGAATCTGCTTAGGGTTAC^ 

TGTrCCGrrCCGAACTGGCTGTTAACGTACTTCTTAGACGAATCCCAATCCGCAAAACGCGACGAAGCGC 

220 230 240 2S0 2S0 270 230 

ATGTACGGGCCAGATATACGCGTTGACATTGATrATTGACTAGTTATTAATAGTAATCAATTACG<K^ 
TACATGCCCGGTCTATATGCGCAACrrGTARCrAATAACTGATCAATAArTATCATTAGTTAATGCCCCAG 

290 300 310 320 330 340 3S0 

ArrAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCG 
TAATCAAGTATCGGGTATATACCTCAAGGCGCaATGTATTGAATGCCATTTACCGGGCGGACCGACTGGC 

360 370 380 390 400 410 420 

CCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATG'rrCCCATAGTAACGCCAATAGGGACTITCC 
GGGTTGCXGGGGGCGGGTAACTGCAGTTATTACTGCATACAAGCWTATCaTTGCGGTTATCCCTGAAAGG 

430 440 4S0 460 470 480 490 

ATTGACGTCAATGGGTGGACTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCC 
TAACTGCAGTTACCCACCTGATAAATGCCATTrGACGGGTGAACCGTCATGTAGTTCACATAGTATACGG 

SOO SIO 520 530 540 SSQ 560 

AAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCAGTACATGACCTTA 
TTCATGCGGGGGATAACTGCAGTTACTGCCATTTACCGGGCGGACCGTAATACGGGTCATGTACTGGAAT 

570 530 590 600 610 620 630 

TGGGACnrnrCCTACrTGGCa.GTAa\.TCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTTTl^ 
ACCCTGAAAGGATGAACCGTCATGTAGATGCATAATCAGTAGCGATAATGGTACCACTACGCCAAAACCG 

640 650 660 670 680 690 700 

AGTACATCAATGGGCGTGQATAGCGGTTTGACTCaiCGGGGATTTCaUiGTCTCCACCCCATTGACGTC^ 
TCATGTAGTrACCCGCACCTATCGCCflAACrGAGTGCCCCTAAAGaTTCAGAGGTGGGGTAACrGCAGTT 



29/42 



CAAATGGGCCSGTAGGCGTGTACGGTGGGAGGTCTATATAAGCaGAGCTCTCTCSqCTAACTAGAGAACCCA 
GTrrACCCGCCATCCGCACATGCCACCCTCCAGATATATrCGTCTCGAGAGACCGATTGATCTCTrCSGGT 



CCrrATAGAGTCTATAGGCCCACCCCCTTGGCTTCTTATGCATGCTATACTGTTTTTGGCTTGGGG 
GGATATCTCAGATATCCGGGTGGGGGAACCGAAGAATACGTACGATATGACAAAAACCGAACCCCAGATA 

990 1000 1010 1020 1030 1040 lOSO 
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1410 1420 1430 1440 X4S0 1460 1470 

AGTGGAGaCCaX3ACrTAGGCACaGCACGATGCCCACCaCCa.CCAGTGTGCCGCACAAGGCCGTtMCGCT 



GGGTATGTGTCTGAARATGAGCTCGGGGAGCGGGOTGCACCGCTGACGCATTTGCaU^GACTTJ^ 
CCCa.TAa^CAGACTTTTACTCGAGCCCCTCGCCCGAACGTGGCGACTGCGTAAACCTTCTGAATTCCGTC 



1830 1840 18S0 1860 1870 1880 1890 

AGCTGGTrCTTTCCGCCTCAGAAGCCA.TAGAGCCCACCGCATCCCCAGCATGCCTGCTATTGTCTTCCCA 
TCGACCAAGAAAGGCGGAGTCrrCGGTATCTCGGGTGGCGTAGGGGTCGTACGGACGATAACAGAAGGGT 

1900 1910 1920 1930 1940 1950 1960 

ATCCTCCC'CCTTGCTGTCCTGCCCCACCCCACCCCCCAGAATAGAATGACACCTACTCAGACAATGCGAT 

1970 1980 1990 2000 2010 2020 2030 

GCAATTTCCTCATTTTATTAGGAAAGGACAGTGGGAGTGGCACCTTCCAGGGTCAAGGAAGGCACGGGGG 



AGGGGCAAACRACAGATGGCTGGCAACTAGAAGGCACAGTCGAGGCTGATCAGCGAGCTCTAGeGGTACC 
GTTGATCTTCCGTGTCAGCTCCGACTAGTCGCTCGAGATCGCCATGG 
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2390 2400 2410 2420 2430 2440 24S0 

AATCTTAGCGCa.GAAGTCyvTGCCCGCTTTrGAGAGGGAGTACTCACCCCAACAGCTGGCCCTCGCAGACA 

2450 2470 2430 2490 2S00 2510 2520 

GCGAATTAATTCCAGCACACTGGCGGCCGTTACTAGTGGATCCGAGCTCGCAAGCTAGCTTGGGTCTCCC 
CGCTTAATTAAGGTCGTGTGACCGCCGGCAATGATCACCTAGGCTCGAGCGrrCGATCGAACCCAGAGGG 
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CMCCATTTACCGTCATTGACGTCAATAGGGGGCGTACTTGGCATATGATACACTTGATGTACTGCCaAGT 
CCGGTAAATGGCAGTAACTGCAGTTATCCCCCGCATGAACCGTATACTATGTGAACTACATGACGGTTCA 



3300 3310 3320 3330 3340 33S0 3360 

AAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAG 
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CCCTGGAAGCTCCCTCGTGCGCrcrCCTGTTCCGACCCTQCCGCTTACCGGATACCTGTCCGCCTr^ 
GGGACCTTCGAGGGAGCACGCGAGAGGACAAGGCTGGGACGGCGAATGGCCrATGGACAGGCGGAAAGAG 



CCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCT 
GGAAGCCCTTCGCACCGCGAAAGAGTTACGAGTGCGACATCCATAGAGTCAAGCCACATCCAGCAAGCGA 



ICTGAAGCCAGTTACCTTCGaAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACA 



AACCACCGCTGGTAGCGGTGGTrrTTTTGTTTGCAaGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAA 
TTGGTGGCGACCATCGCCACCAAAAAAACAAACGTTCGTCGTCTAATGCGCGTCTTTTTTTCCTAGAGTT 

4000 4010 4020 4030 4040 4050 40SO 



:ttttgagtgcaattccctaaaacc 

4110 4120 4130 



AAACGTCrTGCTCGAGGCCGCGATTAAATTCCAACATGGATGCTGATTTATATGGGTATAAATGGGCrCG 
TTTGCAGAACGASSCTCCGGCGCrAATTTARGGTTGTACCTACGACTAAATATACCCATATTTACCCGAGC 
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CGATAATGTCCSGGCAATCaGGTGCGACAATCTATCGATTGTaTGGGRAGCCCGATGCGCCAGAGTTGTrT 
GCTATTACAGCCCGTTAGTCCACGCrGTTAGATAGCrfiACATACCCrrTCGGGCTACGCGGTCIX^ 
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GTAAACTACGAGCTACTOUVAAAGATTAGTCrTAACCAATTRACCAACATTGTGACCGTCT 
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CGCCnrATGTATAAAgrTACATAAATCTTXlTATTTGTrTATCCCCaAGGCGCGTgTARAGGCMC T 

5050 

GCCACCTGACGTC 
CGGTGGACTGCAG 
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GCTAaCGCCGCCa.CCATGGGAATGCAGGTGCAGATCCAGAGCCrGTTTCTGCTCCTCCTGTGGGTGCCCG 
CGATCaCGGCGGTGGTACCCrrACGTCCACGTCTAGGTCrCGGACAAAGACGAGGAGGACACCCACGGGC 
MGMQVQrQSI.FLLLLWVP> 

80 90 100 110 120 130 140 

GGTCCAGAGGAGACACCCTGrGGAAGGCCGGAATCCTGTATAAGGCaUlGTTCGTGGCrGCCrGGACCCT 

GSRGHTLWKAGILYKAKFVAAWTL> 

ISO 160 170 180 190 200 210 

GAAGGCTGCCGCTrrCCTGCCTAGCGATTrCTTTCCTAGCGTGAAGCTGACCCCACraTOCGTGA^ 
CTTCCGACGGCGAAAGGACGGATCGCTAAAGAAAGGATCGCACTTCGACTGGGGTGACACGCaCTGGGAC 
KAAAFL?SDPFPSVKLTPLCVTL> 



TATATGGATGACaTGGTGCr3GGAGCCAGCa.Tca.TaiACTrCGAGAAGCTGG<3ACTGTCCAGATACGTGG 
ATATACCTACTGCa.CCACGACCCTCGGTCGTAGTAGTTGAAGCTCTTCGACCCTGACAGGTCTATGCACC 
YMDDVVLGAS1INFEKLGLSRYV> 



CTAGGCTGATCCTGAAGGAGCCTGTGCACGGCGTGTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGAC 
GATCCGACTAGGACTTCCTCGGACACGTGCCGCACAGGTGGGACGGTCTCTGGTGGCACCACTCCTCCTG 
AR1,ILKE?VHGVSTLPETTVVRRT> 



CGTGTACTATGGAGTGCCTGTGTGGAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGGGTACC 
GCACATGATACCTCACGGACACACCTTCACCGACTCGGACGACCACGGGAAACACCCATGG 
VYYGVPVWKWLSLLVP FVGT> 



FIGDRE 20 
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CGATCGCGGCGGTGGTACCCTTACGTCCACGTCn:AGGTCrCGGACAJUlGACGAGGAGCiA.CACCCAC<Kj<^ 
MGMQVOIQSLFLI.LLWVP> 



GGTCCAGAGGAa^CACCCTG7GGAAGGCCGGAATCCTGTATAAGGCC:yU^GTTCGTGGCTQCCTGGACCCT 
CCAGGTCTCCTQTGTGGGACaCCTTCCGGCCTTAGGACaiTAXTCCGGTTCAAGCACCGACGGACCrGG^ 
GSRGHTLWKAGILYKAKFVAAWTL> 



GAAGGCTGCCGCTTrcCTGCCTAGCg A TTTCT l TCCTAGCGTGAAGCTGACCCCACTGTGCGTGACCCTQ 
CTTCCGACGGCGAAAGGACGGATCGCTAAAGAJUVGGATCGCACTTCGACTGGGGTGACACGCACrGGGAC 
KAA AFLPSDFrPSVKLTPriCVTL> 



TATATGGATGACGTGGTGCT3GGAGTGGGACTGTCCAGGTACGTGGCTAGGCTGATCCTGAAGGAGCCTG 
ATATACCTACTGCACCACGACCCTCACCCTGACAGGTCCATGCACCGATCCGACTAGGACTTCCTCGGAC 
YMDDVVLGVGLSRyVARLILKEP> 



TGCACGGCGTGTCCACCCTGCCAGAGACCACCGTGGTGAGGAGGACCGTGTACTATGGAGTGCCTGTGTG 
ACGTGCCGCACAGGTGGGACGGTCTCTGGTGGCACCACTCCTCCTGGCACATGATACCTCACGGACACAC 
VHGV S TLP ETTVVRRTVYYGVPV W> 



GAAGTGGCTGAGCCTGCTGGTGCCCTTTGTGTGAGGTACC 
CTTCACCGACTCGGACGACCACGGGAAACACACTCCATGG 
KWLSLLVPFV*> 



FIGURE 21 
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